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0. Introduction 


Solid state physics is the study of “stuff”, of how the wonderfully diverse properties of 
solids can emerge from the simple laws that govern electrons and atoms. 


There is one, over-riding, practical reason for wanting to understand the behaviour 
of stuff: this is how we build things. In particular, it is how we build the delicate 
and powerful technologies that underlie our society. Important though they are, such 
practicalities will take a back seat in our story. Instead, our mantra is “knowledge 
for its own sake”. Indeed, the subject of solid state physics turns out to be one of 
extraordinary subtlety and beauty. If such knowledge ultimately proves useful, this is 
merely a happy corollary. 


We will develop only the basics of solid state physics. We will learn how electrons 
glide through seemingly impenetrable solids, how their collective motion is described by 
a Fermi surface, and how the vibrations of the underlying atoms get tied into bundles 
of energy known as phonons. We will learn that electrons in magnetic fields can do 
strange things and start to explore some of the roles that geometry and topology play 
in quantum physics. 


One of the ultimate surprises of solid state physics is how the subject later dovetails 
with ideas from particle physics. At first glance, one might have thought these two 
disciplines should have nothing to do with each other. Yet one of the most striking 
themes in modern physics is how ideas from one have influenced the other. In large 
part this is because both subjects rest on some of the deepest principles in physics: 
ideas such symmetry, topology and universality. Although much of what we cover in 
these lectures will be at a basic level, we will nonetheless see some hints of these deeper 
connections. We will, for example, see the Dirac equation — originally introduced to 
unify relativity and quantum mechanics — emerging from graphene. We will learn how 
the vibrations of a lattice, and the resulting phonons, provide a baby introduction to 
quantum field theory. 


1. Particles in a Magnetic Field 


The purpose of this chapter is to understand how quantum particles react to magnetic 
fields. In contrast to later sections, we will not yet place these particles inside solids, 
for the simple reason that there is plenty of interesting behaviour to discover before we 
do this. Later, in Section 3.1, we will understand how these magnetic fields affect the 
electrons in solids. 


Before we get to describe quantum effects, we first need to highlight a few of the 
more subtle aspects that arise when discussing classical physics in the presence of a 
magnetic field. 


1.1 Gauge Fields 


Recall from our lectures on Electromagnetism that the electric field E(x, t) and mag- 
netic field B(x, t) can be written in terms a scalar potential ¢(x, t) and a vector potential 
A(x, t), 


B=-vo- = and B=VxA (1.1) 


Both ¢ and A are referred to as gauge fields. When we first learn electromagnetism, they 
are introduced merely as handy tricks to help solve the Maxwell equations. However, 
as we proceed through theoretical physics, we learn that they play a more fundamental 
role. In particular, they are necessary if we want to discuss a Lagrangian or Hamiltonian 
approach to electromagnetism. We will soon see that these gauge fields are quite 
indispensable in quantum mechanics. 


The Lagrangian for a particle of charge q and mass m moving in a background 
electromagnetic fields is 


L= imi? + qx- A — a6 (1.2) 
The classical equation of motion arising from this Lagrangian is 
mx = q(E+x x B) 
This is the Lorentz force law. 


Before we proceed I should warn you of a minus sign issue. We will work with 
a general charge g. However, many textbooks work with the charge of the electron, 
written as q = —e. If this minus sign leans to confusion, you should blame Benjamin 
Franklin. 


An Example: Motion in a Constant Magnetic Field 


We’ll take a constant magnetic field, pointing in the z-direction: B = (0,0, B). We’ll 
take E = 0. The particle is free in the z-direction, with the equation of motion mz = 0. 
The more interesting dynamics takes place in the (x, y)-plane where the equations of 
motion are 


mi=qBy and my = —qBt (1.3) 
which has general solution is 
a(t) = X + Rsin(wg(t — to)) and y(t) =Y + Reos(wa(t — to) 


We see that the particle moves in a circle which, for B > 0 B 
and q > 0, is in a clockwise direction. The cyclotron 

frequency is defined by 

_ 4B 
om 


WB (1.4) 


Figure 1: 
The centre of the circle (X,Y), the radius of the circle R 


and the phase to are all arbitrary. These are the four integration constants expected in 
the solution of two, second order differential equations. 


1.1.1 The Hamiltonian 


The canonical momentum in the presence of gauge fields is 
OL 
P= 5 = mxt4q (1.5) 
This clearly is not the same as what we naively call momentum, namely mx. 


The Hamiltonian is given by 


1 
H=x-p—L=5—(p—qgA)’ +a¢ 
mm 


Written in terms of the velocity of the particle, the Hamiltonian looks the same as 
it would in the absence of a magnetic field: H = smx? + qd. This is the statement 
that a magnetic field does no work and so doesn’t change the energy of the system. 
However, there’s more to the Hamiltonian framework than just the value of H. We 
need to remember which variables are canonical. This information is encoded in the 
Poisson bracket structure of the theory (or, in fancy language, the symplectic structure 
on phase space). The fact that x and p are canonical means that 


{xi pj} =ð; with 42495) = {pi pj} =0 


In the quantum theory, this structure transferred onto commutation relations between 
operators, which become 


leg pj] = thoi; with [ees 25 = [pi, pj] = 0 


1.1.2 Gauge Transformations 


The gauge fields A and ¢ are not unique. We can change them as 
Oa 
Prge r and A> A+Va (1.6) 


for any function a(x, t). Under these transformations, the electric and magnetic fields 
(1.1) remain unchanged. The Lagrangian (1.2) changes by a total derivative, but this is 
sufficient to ensure that the resulting equations of motion (1.3) are unchanged. Different 
choices of a are said to be different choices of gauge. We’ll see some examples below. 


The existence of gauge transformations is a redundancy in our description of the 
system: fields which differ by the transformation (1.6) describe physically identical 
configurations. Nothing that we can physically measure can depend on our choice of 
gauge. This, it turns out, is a beautifully subtle and powerful restriction. We will start 
to explore some of these subtleties in Sections 1.3 and 1.4 


The canonical momentum p defined in (1.5) is not gauge invariant: it transforms 
as p > p+qVa. This means that the numerical value of p can’t have any physical 
meaning since it depends on our choice of gauge. In contrast, the velocity of the particle 
x is gauge invariant, and therefore physical. 


The Schrödinger Equation 


Finally, we can turn to the quantum theory. We’ll look at the spectrum in the next 
section, but first we wish to understand how gauge transformations work. Following 
the usual quantisation procedure, we replace the canonical momentum with 


pro —-ihV 
The time-dependent Schrodinger equation for a particle in an electric and magnetic 


field then takes the form 


a 1 
n soe =~ (iV = aA) Y + qoy (1.7) 


The shift of the kinetic term to incorporate the vector potential A is sometimes referred 
to as minimal coupling. 


Before we solve for the spectrum, there are two lessons to take away. The first is that 
it is not possible to formulate the quantum mechanics of particles moving in electric 
and magnetic fields in terms of E and B alone. We’re obliged to introduce the gauge 
fields A and ¢. This might make you wonder if, perhaps, there is more to A and @ 
than we first thought. We’ll see the answer to this question in Section 1.3. (Spoiler: 
the answer is yes.) 


The second lesson follows from looking at how (1.7) fares under gauge transforma- 
tions. It is simple to check that the Schrödinger equation transforms covariantly (i.e. 
in a nice way) only if the wavefunction itself also transforms with a position-dependent 
phase 


(x,t) > elt) Pah (xx, t) (1.8) 


This is closely related to the fact that p is not gauge invariant in the presence of a mag- 
netic field. Importantly, this gauge transformation does not affect physical probabilities 
which are given by |q|?. 


The simplest way to see that the Schrodinger equation transforms nicely under the 
gauge transformation (1.8) is to define the covariant derivatives 


Pd 


D = Ee D; = 5 i 
: 78 ane Ox' h 


In terms of these covariant derivatives, the Schrodinger equation becomes 
Rh 9 
iADwy = ——D*w (1.9) 
2m 


But these covariant derivatives are designed to transform nicely under a gauge trans- 
formation (1.6) and (1.8). You can check that they pick up only a phase 


Div > "Dib and Dy) > e Dib 
This ensures that the Schrödinger equation (1.9) transforms covariantly. 


1.2 Landau Levels 


Our task now is to solve for the spectrum and wavefunctions of the Schrodinger equa- 
tion. We are interested in the situation with vanishing electric field, E = 0, and 
constant magnetic field. The quantum Hamiltonian is 


H = — (p - 44} (1.10) 


We take the magnetic field to lie in the z-direction, so that B = (0,0, B). To proceed, 
we need to find a gauge potential A which obeys V x A = B. There is, of course, no 
unique choice. Here we pick 


A = (0, xB,0) (1.11) 


This is called Landau gauge. Note that the magnetic field B = (0,0, B) is invariant 
under both translational symmetry and rotational symmetry in the (x, y)-plane. How- 
ever, the choice of A is not; it breaks translational symmetry in the x direction (but not 
in the y direction) and rotational symmetry. This means that, while the physics will 
be invariant under all symmetries, the intermediate calculations will not be manifestly 
invariant. This kind of compromise is typical when dealing with magnetic field. 


The Hamiltonian (1.10) becomes 


1 2 2 2 


Because we have manifest translational invariance in the y and z directions, we have 
[py, H] = |p, H] = 0 and can look for energy eigenstates which are also eigenstates of 
py and pz. This motivates the ansatz 


p(x) = eH y (x) (1.12) 


Acting on this wavefunction with the momentum operators py = —ihd, and p, = —thdz, 
we have 


Py = ħkyy and p4 = hk,w 


The time-independent Schrödinger equation is Hy = Ew. Substituting our ansatz 
(1.12) simply replaces p, and p, with their eigenvalues, and we have 


1 
Hy(x) = 5—|p2 + (Aky — gBa)? + RR |Y) = v(x) 
We can write this as an eigenvalue equation for the equation x(x). We have 


Axa) = (B= F) x(a) 


where H is something very familiar: it’s the Hamiltonian for a harmonic oscillator in 


the x direction, with the centre displaced from the origin, 


1 > muĝ 
2 


Ĥ = (£ — kyl2,)? (1.13) 


The frequency of the harmonic oscillator is again the cyloctron frequency wg = qB/m, 
and we’ve also introduced a length scale lg. This is a characteristic length scale which 
governs any quantum phenomena in a magnetic field. It is called the magnetic length. 


| h 
=p 


To give you some sense for this, in a magnetic field of B = 1 Tesla, the magnetic length 
for an electron is lp ~ 2.5 x 1078 m. 


Something rather strange has happened in the Hamiltonian (1.13): the momentum 
in the y direction, Aky, has turned into the position of the harmonic oscillator in the x 
direction, which is now centred at x = kyl%. 


We can immediately write down the energy eigenvalues of (1.13); they are simply 
those of the harmonic oscillator 


1 A? k? 
E = hw = = = 01,2 1.14 
B ( F 3 F Im n ak aet ( ) 
The wavefunctions depend on three quantum numbers, n € N and ky, kz E€ R. They 
are 


Yn pla, y) ~ eiut H (g — kylh)e (0*8) 2b (1.15) 


with H,, the usual Hermite polynomial wavefunctions of the harmonic oscillator. The ~ 
reflects the fact that we have made no attempt to normalise these these wavefunctions. 


The wavefunctions look like strips, extended in the y direction but exponentially 
localised around x = k,l? in the x direction. However, you shouldn’t read too much 
into this. As we will see shortly, there is large degeneracy of wavefunctions and by 
taking linear combinations of these states we can cook up wavefunctions that have 
pretty much any shape you like. 


1.2.1 Degeneracy 


The dynamics of the particle in the z-direction is unaffected by the magnetic field 
B = (0,0, B). To focus on the novel physics, let’s restrict to particles with k, = 0. The 
energy spectrum then coincides with that of a harmonic oscillator, 


Ey = hwp (r + 5) (1.16) 


In the present context, these are called Landau levels. We E 
see that, in the presence of a magnetic field, the energy levels 5 


of a particle become equally spaced, with the gap between n=4 


each level proportional to the magnetic field B. Note that T 
the energy spectrum looks very different from a free particle 
moving in the (x, y)-plane. ash 


k 
The states in a given Landau level are not unique. In- 


stead, there is a huge degeneracy, with many states having Figure 2: Landau Levels 
the same energy. We can see this in the form of the wave- 

functions (1.15) which, when k, = 0, depend on two quantum numbers, n and ky. Yet 
the energy (1.16) is independent of ky. 


Let’s determine how large this degeneracy of states is. To do so, we need to restrict 
ourselves to a finite region of the (x, y)-plane. We pick a rectangle with sides of lengths 
L, and Ly. We want to know how many states fit inside this rectangle. 


Having a finite size L, is like putting the system in a box in the y-direction. The 
wavefunctions must obey 


P(a,ytLy,2)=V(a,y,z) = ey = 1 
This means that the momentum ky is quantised in units of 27/Ly,. 


Having a finite size Lẹ is somewhat more subtle. The reason is that, as we mentioned 
above, the gauge choice (1.11) does not have manifest translational invariance in the 
x-direction. This means that our argument will be a little heuristic. Because the 
wavefunctions (1.15) are exponentially localised around x = kyl%, for a finite sample 
restricted to 0 < x < Ly we would expect the allowed k, values to range between 
0< ky < Lell. The end result is that the number of states in each Landau level is 
given by 


N 


A i ap — Lalu _ IBA 
0 


_Ly = M 11 
Qn 2rl? Qrh (1.17) 


where A = L,Ly is the area of the sample. Strictly speaking, we should take the integer 
part of the answer above. 


The degeneracy (1.17) is very very large. Throwing in some numbers, there are 
around 10'° degenerate states per Landau level for electrons in a region of area A = 
1 cm? in a magnetic field B ~ 0.1 T. This large degeneracy ultimately, this leads to 
an array of dramatic and surprising physics. 


1.2.2 Symmetric Gauge 


It is worthwhile to repeat the calculations above using a different gauge choice. This 
will give us a slightly different perspective on the physics. A natural choice is symmetric 
gauge 


1 B 
A= —9x x B= z (792,0) (1.18) 


This choice of gauge breaks translational symmetry in both the x and the y directions. 
However, it does preserve rotational symmetry about the origin. This means that 
angular momentum is now a good quantum number to label states. 


In this gauge, the Hamiltonian is given by 


1 qBy\* qBt\” a a 
H pa EE [pee oe 


Ee E (2? +") (1.19) 


where we’ve introduced the angular momentum operator 


L, = TPy — YPr 


We’ll again restrict to motion in the (x, y)-plane, so we focus on states with k, = 0. 
It turns out that complex variables are particularly well suited to describing states in 
symmetric gauge, in particular in the lowest Landau level with n = 0. We define 


w=xtiy and w=2-ty 
Correspondingly, the complex derivatives are 
o o ~- 1/28 o 
— —i— d ð=- | = +i 
(2 ix] os 2 (2 +i) 


which obey ðw = OW = 1 and OW = Ow = 0. The Hamiltonian, restricted to states 


0 = 


NI = 


with k, = 0, is then given by 


2h2 = 2 
Pas a oi peg as ry 
m 2 
where now 
L, = h(wd — wð) 


It is simple to check that the states in the lowest Landau level take the form 
Yo(w, 5) = f(w)e has 


for any holomorphic function f(w). These all obey 


Hio(w, 0) = PEP olw, 0) 


which is the statement that they lie in the lowest Landau level with n = 0. We can 
further distinguish these states by requiring that they are also eigenvalues of L}. These 
are satisfied by the monomials, 


Yo = whe P/M > Lapo = RMU (1.20) 
for some positive integer M. 


Degeneracy Revisited 


In symmetric gauge, the profiles of the wavefunctions (1.20) form concentric rings 
around the origin. The higher the angular momentum M, the further out the ring. 
This, of course, is very different from the strip-like wavefunctions that we saw in Landau 
gauge (1.15). You shouldn’t read too much into this other than the fact that the profile 
of the wavefunctions is not telling us anything physical as it is not gauge invariant. 


However, it’s worth revisiting the degeneracy of states in symmetric gauge. The 
wavefunction with angular momentum M is peaked on a ring of radius r = v2Mlp. 
This means that in a disc shaped region of area A = mR?, the number of states is 
roughly (the integer part of) 


BA 
N = R?/ [2m = 5 


which agrees with our earlier result (1.17). 


1.2.3 An Invitation to the Quantum Hall Effect 


Take a system with some fixed number of electrons, which are restricted to move in 
the (x, y)-plane. The charge of the electron is q = —e. In the presence of a magnetic 
field, these will first fill up the V = eBA/2z7h states in the n = 0 lowest Landau level. 
If any are left over they will then start to fill up the n = 1 Landau level, and so on. 


Now suppose that we increase the magnetic field B. The number of states M housed 
in each Landau level will increase, leading to a depletion of the higher Landau levels. 
At certain, very special values of B, we will find some number of Landau levels that 
are exactly filled. However, generically there will be a highest Landau level which is 
only partially filled. 


= ]0 = 


R, (hie?) 


Magnetic field (T) 


Figure 3: The integer quantum Hall ef- Figure 4: The fractional quantum Hall 
fect. effect. 


This successive depletion of Landau levels gives rise to a number of striking signatures 
in different physical quantities. Often these quantities oscillate, or jump discontinuously 
as the number of occupied Landau levels varies. One particular example is the de Haas 
van Alphen oscillations seen in the magnetic susceptibility which we describe in Section 
3.3.4. Another example is the behaviour of the resistivity p. This relates the current 
density J = (Jz, Jy) to the applied electric field E = (E,, Ey), 


E = pJ 


In the presence of an applied magnetic field B = (0,0, B), the electrons move in circles. 
This results in components of the current which are both parallel and perpendicular to 
the electric field. This is modelled straightforwardly by taking p to be a matrix 


— Przy Para 


where the form of the matrix follows from rotational invariance. Here pyx is called the 
longitudinal resistivity while pzy is called the Hall resistivity. 


In very clean samples, in strong magnetic fields, both components of the resistivity 
exhibit very surprising behaviour. This is shown in the left-hand figure above. The 
Hall resistivity pz, increases with B by forming a series of plateaux, on which it takes 


values 
2mh 1 
Pry = Pay vEeEN 
The value of v (which is labelled i = 2,3,... in the data shown above) is measured 


to be an integer to extraordinary accuracy — around one part in 10°. Meanwhile, 


=] i] = 


the longitudinal resistivity vanishes when p,, lies on a plateaux, but spikes whenever 
there is a transition between different plateaux. This phenomenon, called the integer 
Quantum Hall Effect, was discovered by Klaus von Klitzing in 1980. For this, he was 
awarded the Nobel prize in 1985. 


It turns out that the integer quantum Hall effect is a direct consequence of the 
existence of discrete Landau levels. The plateaux occur when precisely v € Z* Landau 
levels are filled. Of course, we’re very used to seeing integers arising in quantum 
mechanics — this, after all, is what the “quantum” in quantum mechanics means. 
However, the quantisation of the resistivity psy is something of a surprise because 
this is a macroscopic quantity, involving the collective behaviour of many trillions of 
electrons, swarming through a hot and dirty system. A full understanding of the integer 
quantum Hall effect requires an appreciation of how the mathematics of topology fits 
in with quantum mechanics. David Thouless (and, to some extent, Duncan Haldane) 
were awarded the 2016 Nobel prize for understanding the underlying role of topology 
in this system. 


Subsequently it was realised that similar behaviour also happens when Landau levels 
are partially filled. However, it doesn’t occur for any filling, but only very special 
values. This is referred to as the fractional quantum Hall effect. The data is shown 
in the right-hand figure. You can see clear plateaux when the lowest Landau level has 


v = = of its states filled. There is another plateaux when v = 2 of the states are 


1 
filled, followed by a bewildering pattern of further plateaux, all of which occur when v 
is some rational number. This was discovered by Tsui and Stormer in 1982. It called 
the Fractional Quantum Hall Effect. The 1998 Nobel prize was awarded to Tsui and 
Stormer, together with Laughlin who pioneered the first theoretical ideas to explain 


this behaviour. 


The fractional quantum Hall effect cannot be explained by treating the electrons 
as free. Instead, it requires us to take interactions into account. We have seen that 
each Landau level has a macroscopically large degeneracy. This degeneracy is lifted by 
interactions, resulting in a new form of quantum liquid which exhibits some magical 
properties. For example, in this state of matter the electron — which, of course, 
is an indivisible particle — can split into constituent parts! The v = z state has 
excitations which carry 1/3 of the charge of an electron. In other quantum Hall states, 
the excitations have charge 1/5 or 1/4 of the electron. These particles also have a 
number of other, even stranger properties to do with their quantum statistics and 


there is hope that these may underly the construction of a quantum computer. 


= [P= 


We will not delve into any further details of the quantum Hall effect. Suffice to say 
that it is one of the richest and most beautiful subjects in theoretical physics. You can 
find a fuller exploration of these ideas in the lecture notes devoted to the Quantum 
Hall Effect. 


1.3 The Aharonov-Bohm Effect 


In our course on Electromagnetism, we learned that the gauge potential A, is unphys- 
ical: the physical quantities that affect the motion of a particle are the electric and 
magnetic fields. Yet we’ve seen above that we cannot formulate quantum mechanics 
without introducing the gauge fields A and @. This might lead us to wonder whether 
there is more to life than E and B alone. In this section we will see that things are, 
indeed, somewhat more subtle. 


1.3.1 Particles Moving around a Flux Tube 


Consider the set-up shown in the figure. We have a solenoid B 
of area A, carrying magnetic field B = (0,0, B) and therefore win N 
magnetic flux @ = BA. Outside the solenoid the magnetic ii 
field is zero. However, the vector potential is not. This fol- 
lows from Stokes’ theorem which tells us that the line integral 
outside the solenoid is given by Z 2 
fA-dx= [B-a8=0 
Bec 
This is simply solved in cylindrical polar coordinates by Figure 5: 
® 
Ag = —— 
° dnr 


Now consider a charged quantum particle restricted to lie in a ring of radius r outside the 
solenoid. The only dynamical degree of freedom is the angular coordinate ¢ € [0, 27). 
The Hamiltonian is 
1 ; 1 dQ qe \? 
H= — (p, — gA) = — | ih -E 
2m (pe — GAs) 2mr? ( i Og 2) 

We'd like to see how the presence of this solenoid affects the particle. The energy 
eigenstates are simply 


p= et nez (1.21) 


—13- 


Figure 6: The energy spectrum for a particle moving around a solenoid. 


where the requirement that w is single valued around the circle means that we must 
take n € Z. Plugging this into the time independent Schrodinger equation Hy = Ew, 
we find the spectrum 


1 g®\* RP p \? 
2mr? ( | 2mr? ¢ =| ine 


where we've defined the quantum of flux Po = 27h/q. (Usually this quantum of flux 
is defined using the electron charge q = —e, with the minus signs massaged so that 
o = 2rh/e > 0.) 


Note that if ® is an integer multiple of o, then the spectrum is unaffected by the 
solenoid. But if the flux in the solenoid is not an integral multiple of Po — and there is 
no reason that it should be — then the spectrum gets shifted. We see that the energy 
of the particle knows about the flux ® even though the particle never goes near the 
region with magnetic field. The resulting energy spectrum is shown in Figure 6. 


There is a slightly different way of looking at this result. Away from the solenoid, 
the gauge field is a total divergence 


A=Va with gee 
20 


This means that we can try to remove it by redefining the wavefunction to be 


oa see (=£) TE (So) y 
2th 


h 


However, there is an issue: the wavefunction should be single-valued. This, after all, 
is how we got the quantisation condition n € Z in (1.21). This means that the gauge 
transformation above is allowed only if ® is an integer multiple of ®) = 27h/q. Only 
in this case is the particle unaffected by the solenoid. The obstacle arises from the fact 
that the wavefunction of the particle winds around the solenoid. We see here the first 
glimpses of how topology starts to feed into quantum mechanics. 
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There are a number of further lessons lurking in this simple quantum mechanical 
set-up. You can read about them in the lectures on the Quantum Hall Effect (see 
Section 1.5.3) and the lectures on Gauge Theory (see Section 3.6.1). 


1.3.2 Aharonov-Bohm Scattering 


The fact that a quantum particle can be affected by A 
even when restricted to regions where B = 0 was first 
pointed out by Aharonov and Bohm in a context which 
is closely related to the story above. They revisited the 
famous double-slit experiment, but now with a twist: 
a solenoid carrying flux ® is hidden behind the wall. 


This set-up is shown in the figure below. Once again, 
the particle is forbidden from going near the solenoid. 


Figure 7: 


Nonetheless, the presence of the magnetic flux affects 
the resulting interference pattern, shown as the dotted line in the figure. 


Consider a particle that obeys the free Schrodinger equation, 
1 2 
=—(-ihV - qA) =ni 
2m 


We can formally remove the gauge field by writing 
= id ae 
vix) = exp (F f ace) ax) o) 


where the integral is over any path. Crucially, however, in the double-slit experiment 
there are two paths, P, and Pz. The phase picked up by the particle due to the gauge 
field differs depending on which path is taken. The phase difference is given by 


Ag=2] A-dx-# Avix=2$ a-dx={ [B-ds 
h Jp, h Jp, h h 

Note that neither the phase arising from path P}, nor the phase arising from path P, is 

gauge invariant. However, the difference between the two phases is gauge invariant. As 

we see above, it is given by the flux through the solenoid. This is the Aharonov-Bohm 


iq®/h 


phase, e , an extra contribution that arises when charged particles move around 


magnetic fields. 


The Aharonov-Bohm phase manifests in the interference pattern seen on the screen. 
As Ẹ is changed, the interference pattern shifts, an effect which has been experimentally 
observed. Only when ® is an integer multiple of ®ọ is the particle unaware of the 
presence of the solenoid. 
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1.4 Magnetic Monopoles 


A magnetic monopole is a hypothetical object which emits a radial magnetic field of 
the form 


Z es [es B=s (1.22) 


Arr? 


Here g is called the magnetic charge. 


We learned in our first course on Electromagnetism that magnetic monopoles don’t 
exist. First, and most importantly, they have never been observed. Second there’s a 
law of physics which insists that they can’t exist. This is the Maxwell equation 


V:-B=0 


Third, this particular Maxwell equation would appear to be non-negotiable. This is 
because it follows from the definition of the magnetic field in terms of the gauge field 


B=VxA > V>B=0 


Moreover, as we’ve seen above, the gauge field A is necessary to describe the quantum 
physics of particles moving in magnetic fields. Indeed, the Aharonov-Bohm effect tells 
us that there is non-local information stored in A that can only be detected by particles 
undergoing closed loops. All of this points to the fact that we would be wasting our 
time discussing magnetic monopoles any further. 


Happily, there is a glorious loophole in all of these arguments, first discovered by 
Dirac, and magnetic monopoles play a crucial role in our understanding of the more 
subtle effects in gauge theories. The essence of this loophole is that there is an ambiguity 
in how we define the gauge potentials. In this section, we will see how this arises. 


1.4.1 Dirac Quantisation 


It turns out that not any magnetic charge g is compatible with quantum mechanics. 
Here we present several different arguments for the allowed values of g. 


We start with the simplest and most physical of these arguments. Suppose that a 
particle with charge q moves along some closed path C in the background of some gauge 
potential A(x). Then, upon returning to its initial starting position, the wavefunction 
of the particle picks up a phase 


yoy with a= $ A -dx (1.23) 
C 


This is the Aharonov-Bohm phase described above. 
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Figure 8: Integrating over S... Figure 9: ...or over S’. 


The phase of the wavefunction is not an observable quantity in quantum mechanics. 
However, as we described above, the phase in (1.23) is really a phase difference. We 
could, for example, place a particle in a superposition of two states, one of which stays 
still while the other travels around the loop C. The subsequent interference will depend 


iqa/h 


on the phase e , just like in the Aharonov-Bohm effect. 


Let’s now see what this has to do with magnetic monopoles. We place our particle, 
with electric charge q, in the background of a magnetic monopole with magnetic charge 
g. We keep the magnetic monopole fixed, and let the electric particle undergo some 
journey along a path C. We will ask only that the path C avoids the origin where the 
magnetic monopole is sitting. This is shown in the left-hand panel of the figure. Upon 
returning, the particle picks up a phase e’@°/" with 


a= f A-dx= / Bods 
c S 


where, as shown in the figure, S is the area enclosed by C. Using the fact that Soz B. 
dS = g, if the surface S makes a solid angle 2, this phase can be written as 


Ng 
a = -> 
4r 
However, there’s an ambiguity in this computation. Instead of integrating over S, it 
is equally valid to calculate the phase by integrating over S’, shown in the right-hand 
panel of the figure. The solid angle formed by S” is Q! = 4r — Q. The phase is then 
given by 
1 (4r — Q)g 
4r 


where the overall minus sign comes because the surface S’ has the opposite orientation 
to S. As we mentioned above, the phase shift that we get in these calculations is 


MES 


observable: we can’t tolerate different answers from different calculations. This means 
that we must have e%/" = ¢’%'/h_ This gives the condition 


qg=2rhn withneZ (1.24) 


This is the famous Dirac quantisation condition. The smallest such magnetic charge 
has n = 1. It coincides with the quantum of flux, g = ®) = 27h/q. 


Above we worked with a single particle of charge g. Obviously, the same argument 
must hold for any other particle of charge q’. There are two possibilities. The first is 
that all particles carry charge that is an integer multiple of some smallest unit. In this 
case, it’s sufficient to impose the Dirac quantisation condition (1.24) where q is the 
smallest unit of charge. For example, in our world we should take q = +e to be the 
electron or proton charge (or, if we look more closely in the Standard Model, we might 
choose to take q = —e/3, the charge of the down quark). 


The second possibility is that the particles carry electric charges which are irrational 
multiples of each other. For example, there may be a particle with charge q and another 
particle with charge v2q. In this case, no magnetic monopoles are allowed. 


It’s sometimes said that the existence of a magnetic monopole would imply the 
quantisation of electric charges. This, however, has it backwards. (It also misses the 
point that we have a wonderful explanation of the quantisation of charges from the 
story of anomaly cancellation in the Standard Model.) There are two possible groups 
that could underly gauge transformations in electromagnetism. The first is U(1); this 
has integer valued charges and admits magnetic monopoles. The second possibility is 
R; this has irrational electric charges and forbids monopoles. All the evidence in our 
world points to the fact that electromagnetism is governed by U(1) and that magnetic 
monopoles should exist. 


Above we looked at an electrically charged particle moving in the background of 
a magnetically charged particle. It is simple to generalise the discussion to particles 
that carry both electric and magnetic charges. These are called dyons. For two dyons, 
with charges (q1, 91) and (qo, g2), the generalisation of the Dirac quantisation condition 
requires 


ng2 — gag © TAZ 


This is sometimes called the Dirac-Zwanziger condition. 
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1.4.2 A Patchwork of Gauge Fields 


The discussion above shows how quantum mechanics constrains the allowed values of 
magnetic charge. It did not, however, address the main obstacle to constructing a 
magnetic monopole out of gauge fields A when the condition B = V x A would seem 
to explicitly forbid such objects. 


Let’s see how to do this. Our goal is to write down a configuration of gauge fields 
which give rise to the magnetic field (1.22) of a monopole which we will place at the 
origin. However, we will need to be careful about what we want such a gauge field to 
look like. 


The first point is that we won’t insist that the gauge field is well defined at the origin. 
After all, the gauge fields arising from an electron are not well defined at the position of 
an electron and it would be churlish to require more from a monopole. This fact gives 
us our first bit of leeway, because now we need to write down gauge fields on R*/{0}, 
as opposed to R? and the space with a point cut out enjoys some non-trivial topology 
that we will make use of. 


Consider the following gauge connection, written in spherical polar coordinates 


g 1-—cosé 
AN = 1.25 
¢ Arr sinb ( ) 
The resulting magnetic field is 
1 o 10 ~ 
B= A= — (AÑ sin 6) # — —_—(rA®)0 
ke rsin gl panee a $) 
Substituting in (1.25) gives 
gr 
= 1.26 
Arr? ( ) 


In other words, this gauge field results in the magnetic monopole. But how is this 
possible? Didn’t we learn in kindergarten that if we can write B = V x A then 
f dS -B = 0? How does the gauge potential (1.25) manage to avoid this conclusion? 


The answer is that A in (1.25) is actually a singular gauge connection. It’s not just 
singular at the origin, where we’ve agreed this is allowed, but it is singular along an 
entire half-line that extends from the origin to infinity. This is due to the 1/ sin 0 term 
which diverges at 0 = 0 and 6 = m. However, the numerator 1 — cos 0 has a zero when 
0 = 0 and the gauge connection is fine there. But the singularity along the half-line 
0 = 7 remains. The upshot is that this gauge connection is not acceptable along the 
line of the south pole, but is fine elsewhere. This is what the superscript N is there to 
remind us: we can work with this gauge connection s long as we keep north. 
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Now consider a different gauge connection 


g 1+ cosé 
Arr sind 


Ag = (1.27) 
This again gives rise to the magnetic field (1.26). This time it is well behaved at 0 = 7, 
but singular at the north pole 0 = 0. The superscript S is there to remind us that this 
connection is fine as long as we keep south. 


At this point, we make use of the ambiguity in the gauge connection. We are going 
to take AÙ in the northern hemisphere and A® in the southern hemisphere. This is 
allowed because the two gauge potentials are the same up to a gauge transformation, 
A —> A + Va. Recalling the expression for Va in spherical polars, we find that for 
6 # 0,7, we can indeed relate Ag and AS by a gauge transformation, 
go 


pa where a = — (1.28) 


AN = AS 
$ $ T On 


1 
rsin 
However, there’s still a question remaining: is this gauge transformation allowed? The 
problem is that the function a is not single valued: a(¢ = 27) = a(@ = 0) + g. And 


this should concern us because, as we’ve seen in (1.8), the gauge transformation also 
acts on the wavefunction of a quantum particle 


w = ele hay, 


There’s no reason that we should require the gauge transformation a to be single- 
valued, but we do want the wavefunction w~ to be single-valued. This holds for the 
gauge transformation (1.28) provided that we have 


qg =2rhn withne Z 


This, of course, is the Dirac quantisation condition (1.24). 


Mathematically, we have constructed of a topologically non-trivial U(1) bundle over 
the S? surrounding the origin. In this context, the integer n is called the first Chern 
number. 


1.4.3 Monopoles and Angular Momentum 


Here we provide yet another derivation of the Dirac quantisation condition, this time 
due to Saha. The key idea is that the quantisation of magnetic charge actually follows 
from the more familiar quantisation of angular momentum. The twist is that, in the 
presence of a magnetic monopole, angular momentum isn’t quite what you thought. 
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To set the scene, let’s go back to the Lorentz force law 


with p = mx. Recall from our discussion in Section 1.1.1 that p defined here is not 
the canonical momentum, a fact which is hiding in the background in the following 
derivation. Now let’s consider this equation in the presence of a magnetic monopole, 
with 


The monopole has rotational symmetry so we would expect that the angular momen- 
tum, x x p, is conserved. Let’s check: 


d 
AXP) L gx pt xx pax x p= ax x (k x B) 


= foxx exx) =H (F-3) 


Arr? 
e 
= — | =f 
dt \4r 
We see that in the presence of a magnetic monopole, the naive 
angular momentum x x p is not conserved! However, as we also 
noticed in the lectures on Classical Dynamics (see Section 4.3.2), 


we can easily write down a modified angular momentum that is 
conserved, namely 
qg ~ 


Eo GD F 
Tv 


The extra term can be thought of as the angular momentum stored 


Figure 10: 


in E x B. The surprise is that the system has angular momentum 
even when the particle doesn’t move. 


Before we move on, there’s a nice and quick corollary that we can draw from this. 
The angular momentum vector L does not change with time. But the angle that the 
particle makes with this vector is 


L-tr= -I constant 
AT 


This means that the particle moves on a cone, with axis L and angle cos 0 = —qg/4r L. 


SiN = 


So far, our discussion has been classical. Now we invoke some simple quantum 
mechanics: the angular momentum should be quantised. In particular, the angular 
momentum in the z-direction should be L, € ZAZ. Using the result above, we have 


qg 


1 
=-ħn => qg=2rħn wthn € Z 
Ar 2 


Once again, we find the Dirac quantisation condition. 


1.5 Spin in a Magnetic Field 


As we’ve seen in previous courses, particles often carry an intrinsic angular momentum 
called spin S. This spin is quantised in half-integer units. For examples, electrons have 
spin 5 and their spin operator is written in terms of the Pauli matrices o, 


h 
S=- 
57 
Importantly, the spin of any particle couples to a background magnetic field B. The 
key idea here is that the intrinsic spin acts like a magnetic moment m which couples 
to the magnetic field through the Hamiltonian 


H=-m-B 


The question we would like to answer is: what magnetic moment m should we associate 


with spin? 
A full answer to this question would require an ex- “fee A 
tended detour into the Dirac equation. Here we pro- a eee Se 


vide only some basic motivation. First consider a par- 
ticle of charge q moving with velocity v around a circle = 22000 
of radius r as shown in the figure. From our lectures on Figure 11: 
Electromagnetism, we know that the associated magnetic 


moment is given by 


ms- rya i 
2 2m 
where L = mr x v is the orbital angular momentum of the particle. Indeed, we already 
saw the resulting coupling H = —(q/2m)L - B in our derivation of the Hamiltonian in 


symmetric gauge (1.19). 
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Since the spin of a particle is another contribution to the angular momentum, we 
might anticipate that the associated magnetic moment takes the form 


q 
=g—S 
a Jom 
where g is some dimensionless number. (Note: g is unrelated to the magnetic charge 
that we discussed in the previous section!) This, it turns out, is the right answer. 
However, the value of g depends on the particle under consideration. The upshot is 
that we should include a term in the Hamiltonian of the form 


q 
H = —g——S -B 1.29 
Ue (1.29) 


The g-factor 


For fundamental particles with spin i — such as the electron — there is a long and 
interesting history associated to determining the value of g. For the electron, this was 
first measured experimentally to be 


Je = 2 


Soon afterwards, Dirac wrote down his famous relativistic equation for the electron. 
One of its first successes was the theoretical prediction ge = 2 for any spin i particle. 
This means, for example, that the neutrinos and quarks also have g = 2. 


This, however, was not the end of the story. With the development of quantum field 
theory, it was realised that there are corrections to the value ge = 2. These can be 
calculated and take the form of a series expansion, starting with 


ge =2 (1+ = +...) = 2.00232 
2r 


where a = e?/4reoħc ~ 1/137 is the dimensionless fine structure constant which char- 
acterises the strength of the Coulomb force. The most accurate experimental measure- 
ment of the electron magnetic moment now yields the result 


ge ~ 2.00231930436182 + 2.6 x 107"? 


Theoretical calculations agree to the first ten significant figures or so. This is the most 
impressive agreement between theory and experiment in all of science! Beyond that, 
the value of œ is not known accurately enough to make a comparison. Indeed, now 
the measurement of the electron magnetic moment is used to define the fine structure 
constant a. 


ee 


While all fundamental spin z particles have g œ~ 2, this does not hold for more 


complicated objects. For example, the proton has 
Jp ~ 5.588 


while the neutron — which of course, is a neutral particle, but still carries a magnetic 
moment — has 


In ~ —3.823 


where, because the neutron is neutral, the charge q = e is used in the formula (1.29). 
These measurements were one of the early hints that the proton and neutron are com- 
posite objects. 


1.5.1 Spin Precession 


Consider a constant magnetic field B = (0,0, B). We would like to understand how 
this affects the spin of an electron. We’ll take ge = 2. We write the electric charge of 


the electron as q = —e so the Hamiltonian is 
h 
H= eB 
2m 


The eigenstates are simply the spin-up |f} and spin-down ||) states in the z-direction. 
They have energies 


hw hw 
Alt)=—"It) and HI) =- 14) 


where wg = eB/m is the cyclotron frequency which appears throughout this chapter. 


What happens if we do not sit in an energy eigenstate. A 
general spin state can be expressed in spherical polar coordinates 
as 


|2(8, 6) = cos(8/2)|T) + e° sin(6/2)|1) 


As a check, note that |W(0 = 7/2, @)) is an eigenstate of o” when 
@ = 0,7 and an eigenstate of o” when ọ = 7/2,37/2 as it 


Figure 12: 


should be. The evolution of this state is determined by the time- 
dependent Schrodinger equation 


Ab) _ 
1 >, = HW) 
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which is easily solved to give 
CO, 654) = e*t] cos(/2)|+) + ef? sin(/2)|L) 


We see that the effect of the magnetic field is to cause the spin to precess about the B 
axis, as shown in the figure. 


1.5.2 A First Look at the Zeeman Effect 


The Zeeman effect describes the splitting of atomic energy levels in the presence of a 
magnetic field. Consider, for example, the hydrogen atom with Hamiltonian 


i Le 
H = -—V? —- —_—— 
2m ATE r 
The energy levels are given by 
ame 1 


where a is the fine structure constant. Each energy level has a degeneracy of states. 
These are labelled by the angular momentum / = 0,1,...,n — 1 and the z-component 
of angular momentum m; = —l,..., +l. Furthermore, each electron carries one of two 
spin states labelled by m, = +4. This results in a degeneracy given by 


n—1 
Degeneracy = 2 X (2141) = 2n? 
1=0 


Now we add a magnetic field B = (0,0, B). As we have seen, this results in perturbation 
to the Hamiltonian which, to leading order in B, is given by 


AH =; (L+ g.S)-B 


e 
m 

In the presence of such a magnetic field, the degeneracy of the states is split. The 

energy levels now depend on the quantum numbers n, m; and m, and are given by 


le = En a (mu + 2m.) B 


The Zeeman effect is developed further in the Lectures on Topics in Quantum Mechanics. 
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2. Band Structure 


In this chapter, we start our journey into the world of condensed matter physics. This 
is the study of the properties of “stuff’. Here, our interest lies in a particular and 
familiar kind of stuff: solids. 


Solids are collections of tightly bound atoms. For most solids, these atoms arrange 
themselves in regular patterns on an underlying crystalline lattice. Some of the elec- 
trons of the atom then disassociate themselves from their parent atom and wander 
through the lattice environment. The properties of these electrons determine many of 
the properties of the solid, not least its ability to conduct electricity. 


One might imagine that the electrons in a solid move in a fairly random fashion, as 
they bounce from one lattice site to another, like a ball in a pinball machine. However, 
as we will see, this is not at all the case: the more fluid nature of quantum particles 
allows them to glide through a regular lattice, almost unimpeded, with a distorted 
energy spectrum the only memory of the underlying lattice. 


In this chapter, we will focus on understanding how the energy of an electron depends 
on its momentum when it moves in a lattice environment. The usual formula for kinetic 
energy, E = smu" = p”/2m, is one of the first things we learn in theoretical physics as 
children. As we will see, a lattice changes this in interesting ways, the consequences of 
which we will explore in chapter 3. 


2.1 Electrons Moving in One Dimension 


We begin with some particularly simple toy models which capture much of the relevant 
physics. These toy models describe an electron moving in a one-dimensional lattice. 
We’ll take what lessons we can from this before moving onto more realistic descriptions 
of electrons moving in higher dimensions. 


2.1.1 The Tight-Binding Model 


The tight-binding model is a caricature of electron motion in solid in which space is 
made discrete. The electron can sit only on the locations of atoms in the solid and has 
some small probability to hop to a neighbouring site due to quantum tunnelling. 


To start with our “solid” consists of a one-dimensional lattice of atoms. This is 
described by N points arranged along a line, each separated by distance a. 
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Consider a single electron moving on this lattice. We will assume that the electron 
can only sit on a given lattice point; it’s not allowed to roam between lattice points. 
This is supposed to mimic the idea that electrons are bound to the atoms in a lattice 
and goes by the name of the tight-binding approximation. (We’ll see exactly what we’re 
neglecting in this approximation later.) 


When the electron sits on the n'è atom, we denote the quantum state as |n}. These 
states are considered orthogonal to each other, so 


(n|m) = Onm 


Clearly the total Hilbert space has dimension N, and is spanned by |n) with n = 
Totta Ne 


What kind of Hamiltonian will govern the dynamics of this electron? If the electron 
just remains on a given atom, an appropriate Hamiltonian would be 


Ho = Eo X |n) (nl 


Each of the position states |n) is an energy eigenstate of Hp with energy Eo. The 
electrons governed by this Hamiltonian don’t move. This Hamiltonian is boring. 


To make things more interesting, we need to include the possibility that the electron 
can tunnel from one site to another. How to do this? Well, the Hamiltonian governs 
time evolution. In some small time increment of time At, a state evolves as 

iAt 
jb) > |v) — SAW) + O(a) 
This means that if we want the possibility for the electron to hop from one site to 
another, we should include in the Hamiltonian a term of the form |m)(n| which takes 
an electron at site n and moves it to an electron at site m. 


There is one last ingredient that we want to feed into our model: locality. We don’t 
want electrons to disappear and reappear many thousands of lattice spacings down the 
line. We want our model to describe electrons hopping from one atom to neighbouring 
atoms. This motivates our final form of the Hamiltonian, 


H = Ey >> |n)(n| -tY (In) (n+ 1] + In + 1) (nl) (2.1) 


— 27 = 


First a comment on notation: the parameter t is called the hopping parameter. It is not 
time; it is simply a number which determines the probability that a particle will hop 
to a neighbouring site. (More precisely, the ratio t?/E? will determine the probability. 
of hopping.) It’s annoying notation, but unfortunately t is the canonical name for this 
hopping parameter so it’s best we get used to it now. 


Now back to the physics encoded in H. We’ve chosen a Hamiltonian that only 
includes hopping terms between neighbouring sites. This is the simplest choice; we will 
describe more general choices later. Moreover, the probability of hopping to the left is 
the same as the probability of hopping to the right. This is required because H must 
be a Hermitian operator. 


There’s one final issue that we have to address before solving for the spectrum of H: 
what happens at the edges? Again, there are a number of different possibilities but 
none of the choices affect the physics that we’re interested in here. The simplest option 
is simply to declare that the lattice is periodic. This is best achieved by introducing a 
new state |N + 1), which sits to the right of |N}, and is identified with |N + 1) = |1). 


Solving the Tight-Binding Model 


Let’s now solve for the energy eigenstates of the Hamiltonian (2.1). A general state 
can be expanded as 


Ib) = do dl) 
with Yn E€ C. Substituting this into the Schrodinger equation gives 


Hh) = EW) = EoŅ Yml) -tà X dari) + balm + 1)) = EY mlm) 


If we now take the overlap with a given state (n|, we get the set of linear equations for 
the coefficients Yn 


(n|H |p) = Eln) => Bota — t(ngi + Vn-1) = Edn (2.2) 


These kind of equations arise fairly often in physics. (Indeed, they will arise again in 
Section 4 when we come to discuss the vibrations of a lattice.) They are solved by the 
ansatz 


Wn Z eikna (2.3) 


Or, if we want to ensure that the wavefunction is normalised, Y, = e’*"*/ VN. The 
exponent k is called the wavenumber. The quantity p = hk plays a role similar to 
momentum in our discrete model; we will discuss the ways in which it is like momentum 
in Section 2.1.4. We’ll also often be lazy and refer to k as momentum. 
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The wavenumber has a number of properties. First, the set of solutions remain the 
same if we shift k —> k + 2r /a so the wavenumber takes values in 


ke |-=,+=) (2.4) 


This range of k is given the fancy name Brillouin zone. We’ll see why this is a useful 
concept that deserves its own name in Section 2.2. 


There is also a condition on the allowed values of k coming from the requirement of 
periodicity. We want Wy4, = Y1, which means that e'N% = 1. This requires that k 
is quantised in units of 27/aN. In other words, within the Brillouin zone (2.4) there 
are exactly N quantum states of the form (2.3). But that’s what we expect as it’s the 
dimension of our Hilbert space; the states (2.3) form a different basis. 


States of the form (2.3) have the property that Z 


nai = ecient 3L 


This immediately ensures that equation (2.2) is 
solved for any value of k, with the energy eigen- in 
value 


E = Ey — 2tcos(ka) (2.5) i i i k 


The spectrum is shown in the figure for t > 0. Figure 1s: 


(The plot was made with a = t = 1 and Ep = 2.) The states with k > 0 describe 
electrons which move to the right; those with k < 0 describe electrons moving to the 
left. 


There is a wealth of physics hiding in this simple result, and much of the following 
sections will be fleshing out these ideas. Here we highlight a few pertinent points 


e The electrons do not like to sit still. The eigenstates |n) of the original Hamil- 
tonian Ho were localised in space. One might naively think that adding a tiny 
hopping parameter t would result in eigenstates that were spread over a few sites. 
But this is wrong. Instead, all energy eigenstates are spread throughout the whole 
lattice. Arbitrarily small local interactions result in completely delocalised energy 


eigenstates. 


e The energy eigenstates of Ho were completely degenerate. Adding the hopping 
term lifts this degeneracy. Instead, the eigenstates are labelled by the wavevector 
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k and have energies (2.5) that lie in a range E(k) € [Eo — 2t, Eo + 2t]. This 
range of energies is referred to a band and the difference between the maximum 
and minimum energy (which is 4¢ in this case) is called the band width. In our 
simple model, we have just a single energy band. In subsequent models, we will 
see multiple bands emerging. 


For suitably small momentum, k < 7/a, we can Taylor expand the energy (2.5) 
as 


E(k) ~ (Eo — 2t) + ta? k? 


Up to a constant, this takes the same form as a free particle moving in the 
continuum, 


Ere = 2. 
f = (2.6) 


This is telling us that low energy, low momentum particles are unaware that they 
are moving on an underlying lattice. Instead, they act as if they are moving along 
a continuous line with effective mass m* = h?/2ta?. Notice that in this model 
the effective mass has nothing to do with the physical mass of the electron; it is 
inherited from properties of the lattice. 


There is a cute reciprocity between the properties of momentum and position. 
We know from our first course on quantum mechanics that if space is made finite 
— for example, a particle in a box, or a particle moving on a circle — then 
momentum becomes discrete. We also saw this above as the periodic boundary 
conditions enforced the wavenumber to be quantised in units of 27/Na. 


However, our tight-binding model also exhibits the converse phenomenon: when 
we make space discrete, momentum becomes periodic: it has to lie in the Brillouin 
zone (2.4). More generally, discreteness is the Fourier transform of compactness. 


A First Look at Metals and Insulators 


There’s further physics to uncover if we consider more than one electron moving in 


the lattice. This section is just to give a flavour of these ideas; we will discuss them 


in more detail in Section 3.1. For simplicity, we will assume that the electrons do not 


interact with each other. Now the state of the system is governed by the Pauli exclusion 


principle: two electrons are not allowed to occupy the same state. 
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As we have seen, our tight-binding model contains N states. However, each electron 
has two internal states, spin |f} and spin ||). This means that, in total, each electron 
can be in one of 2N different states. Invoking the Pauli exclusion principle, we see that 
our tight-binding model makes sense as long as the number of electrons is less than or 
equal to 2N. 


The Pauli exclusion principle means that the ground state of a multi-electron system 
has interesting properties. The first two electrons that we put in the system can both 
sit in the lowest energy state with k = 0 as long as they have opposite spins. The next 
electron that we put in finds these states occupied; it must sit in the next available 
energy state which has k = +27/Na. And so this continues, with subsequent electrons 
sitting in the lowest energy states which have not previously been occupied. The net 
result is that the electrons fill all states up to some final kp which is known as the Fermi 
momentum. The boundary between the occupied and unoccupied states in known as 
the Fermi surface. Note that it is a surface in momentum space, rather than in real 
space. We will describe this in more detail in Section 3.1. (See also the lectures on 
Statistical Physics.) 


How many electrons exist in a real material? Here something nice happens, because 
the electrons which are hopping around the lattice come from the atoms themselves. 
One sometimes talks about each atom “donating” an electron. Following our chemist 
friends, these are called valence electrons. Given that our lattice contains N atoms, 
it’s most natural to talk about the situation where the system contains ZN electrons, 
with Z an integer. The atom is said to have valency Z. 


Suppose Z = 1, so we have N electrons. Then only te 
half of the states are filled and kp = 1/2a. This is 
shown in the figure. Note that there are as many 
electrons moving to the left (with k < 0) as there af 
are electrons moving to the right (k > 0). This is it 
the statement that there is no current in the ground 


state of the system. fe ee 


We can now ask: what are the low-energy excita- Figure 14: 


tions of the system? We see that there are many: we 

can take any electron just below the Fermi surface and promote it to an electron just 
above the Fermi surface at a relatively small cost in energy. This becomes particularly 
relevant if we perturb the system slightly. For example, we could ask: what happens 
if we apply an electric field? As we will describe in more detail in 3.1.1, the ground 
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state of the system re-arranges itself at just a small cost of energy: some left-moving 
states below the Fermi surface become unoccupied, while right-moving states above the 
Fermi surface become occupied. Now, however, there are more electrons with k > 0 
than with k < 0. This results in an electrical current. What we have just described is 
a conductor. 


Let’s contrast this with what happens when we have ia 
2N electrons in the system. Now we don’t get any 
choice about how to occupy states since all are occu- 
pied. Said another way, the multi-particle Hilbert 
space contains just a single state: the fully filled it 
band. This time, if we perturb with an electric field 


then the electrons can’t move anywhere, simply be- = eae See aa i a 
cause there’s no where for them to go: they are locked 
in place by the Pauli principle. This means that, de- 
spite the presence of the electric field, there is no electric current. This is what we call 
an insulator. (It is sometimes said to be a band insulator to distinguish it from other 


Figure 15: 


mechanisms that also lead to insulating behaviour.) 


The difference between a conductor and an insulator is one of the most striking 
characterisations of materials, one that we all learn in high school. The rough sketch 
above is telling us that this distinction arises due to quantum phenomena: the formation 
of energy bands and the Pauli exclusion principle. We’ll explore this more in Section 
asl 


2.1.2 Nearly Free Electrons 


The tight-binding model is an extreme cartoon of the real physics in which space is 
discrete; electrons are stuck on atomic sites with a non-vanishing probability to hop 
to a neighbouring site. In this section we present another cartoon that is designed to 
capture the opposite extreme. 


We will assume that our electron is free to move anywhere along the line, parame- 
terised by the position x. To mimic the underlying lattice, we add a weak, periodic 
potential V(x). This means that we consider the Hamiltonian 


p? 
H = +— + V(x) 
2m 
where p = —ihd/dz is the usual momentum operator. The periodicity of the potential 
means that it satisfies 


V(zx+a)= V(x) (2.7) 
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V(x) V(x) 


Figure 16: A periodic sine wave. Figure 17: A periodic square wave. 


For example, the potential could take the form of a sine wave, or a square wave as 
shown in the figure, or it could be a an infinite series of delta functions. For much of 
our discussion we won’t need the exact form of the potential. 


To avoid discussing edge effects, it’s again useful to consider the particle moving 
on a circle St of length (circumference) L. This is compatible with the periodicity 
requirement (2.7) only if L/a = N € Z. The integer N plays the role of the number of 
atoms in the lattice. 


In the absence of the potential, the eigenstates are the familiar plane waves |k), 
labelled by the momentum p = hk. Because we are on a circle, the wavenumber of k is 
quantised in units of 27/Z. The associated wavefunctions are 


belt) = (2|k) = Ta (2.8) 


These states are are orthonormal, with 
nl tie he 
(klk) = a dx e = Õk,k' (2.9) 


(Recall that we are living on a circle, so the momenta k are discrete and the Kronecker 
delta is the appropriate thing to put on the right-hand side.) Meanwhile, the energy 
of a free particle is given by 
hek? 
E(k) = (2.10) 


2m 


Our goal is to understand how the presence of the potential V(x) affects this energy 
spectrum. To do this, we work perturbatively. However, perturbation theory in the 
present situation is a little more subtle than usual. Let’s see why. 
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Perturbation Theory 


Recall that the first thing we usually do in perturbation theory is decide whether 
we have non-degenerate or degenerate energy eigenstates. Which do we have in the 
present case? Well, all states are trivially degenerate because the energy of a free 
particle moving to the right is the same as the energy of a free particle moving to the 
left: Eo(k) = Eo(—k). But the fact that the two states |k} and |—k) have the same 
energy does not necessarily mean that we have to use degenerate perturbation theory. 
This is only true if the perturbation causes the two states to mix. 


To see what happens we will need to compute matrix elements (k|V |k’). The key bit 
of physics is the statement that the potential is periodic (2.7). This ensures that it can 
be Fourier expanded 


= e°"" with Wav, 
neZ 


where the Fourier coefficients follow from the inverse transformation 


a i 
V, = F dx V(x) eo 2ting/a 


a Jo 


The matrix elements are then given by 


(k|V |k’) =z fa > ae OS yo eee (2.11) 


neZ neZ 
We see that we get mixing only when 


27M 
k = ki + — 
for some integer n. In particular, we get mixing between degenerate states |k} and |—k) 
only when 
ka 
a 
for some n. The first time that this happens is when k = z/a. But we’ve seen this 
value of momentum before: it is the edge of the Brillouin zone (2.4). This is the first 
hint that the tight-binding model and nearly free electron model share some common 
features. 


With this background, let’s now try to sketch the basic features of the energy spec- 
trum as a function of k. 
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Low Momentum: With low momentum |k| < 7/a, there is no mixing between states 


at leading order in perturbation theory (and very little mixing at higher order). In 
this regime we can use our standard results from non-degenerate perturbation theory. 
Expanding the energy to second order, we have 


_ “ + (k|V|k) + 32 ee ae (2.12) 


k'£k 


E(k) 


From (2.11), we know that the first order correction is (k|V|k) = Vo, and so just 
gives a constant shift to the energy, independent of k. Meanwhile, the second order 
term only gets contributions from |k’) = |k+2an/a) for some n. When |k| < z/a, 
these corrections are small. We learn that, for small momenta, the particle moves as if 
unaffected by the potential. Intuitively, the de Broglie wavelength 27/k of the particle 
much greater than the wavelength a of the potential, and the particle just glides over 
it unimpeded. 


The formula (2.12) holds for low momenta. It also holds for momenta mn/a < 
k < m(n + 1)/a which are far from the special points where mixing occurs. However, 
the formula knows about its own failings because if we attempt to use it when k = 
n/a for some n, the the numerator (k|V|—&) is finite while the denominator becomes 
zero. Whenever perturbation theory diverges in this manner it’s because we’re doing 
something wrong. In this case it’s because we should be working with degenerate 
perturbation theory. 


At the Edge of the Brillouin Zone: Let’s consider the momentum eigenstates which 


sit right at the edge of the Brillouin zone, k = a /a, or at integer multiples 


k= — 
a 
As we’ve seen, these are the values which mix due to the potential perturbation and 
we must work with degenerate perturbation theory. 


Let’s recall the basics of degenerate perturbation theory. We focus on the subsector of 
the Hilbert space formed by the two degenerate states, in our case |k} and |k") = |—k). 
To leading order in perturbation theory, the new energy eigenstates will be some linear 
combination of these original states alk) + 6|k’). We would like to figure out what 
choice of a and £ will diagonalise the new Hamiltonian. There will be two such choices 
since there must, at the end of the day, remain two energy eigenstates. To determine 
the correct choice of these coefficients, we write the Schrodinger equation, restricted to 
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this subsector, in matrix form 


(kIHIk) (EHIK) \ fa a 
=E (2.13) 
(k’|E|k) (k'|H|k } \ 8 B 
We’ve computed the individual matrix elements above: using the fact that the states 


|k} are orthonormal (2.9), the unperturbed energy (2.10) and the potential matrix 
elements (2.11), our eigenvalue equation becomes 


Eo(k) + Vo Va a\_,7{¢ (2.14) 
Vz Eg(k’') +o) \8 b 


where, for the value k = —k’ = nr /a of interest, Eo(k) = Eo(k’) = n?R? r? /2ma?. It’s 
simple to determine the eigenvalues Æ of this matrix: they are given by the roots of 
the quadratic equation 

K nr? 


E =f) |V = pe. 
(Eo(k) + Vo = E} —Val?=0 > ee 


+Vo+|V,| (2.15) 


This is important. We see that a gap opens up in the spectrum at the values k = +n7/a. 
The size of the gap is proportional to 2|V,,|. 


It’s simple to understand what’s going on here. Consider the simple potential 


2 
V = 2V; cos (=) 


which gives rise to a gap only at k = +r/a. The eigenvectors of the matrix are 
(a, 6) = (1,—1) and (a, 8) = (1,1), corresponding to the wavefunctions 


balz) = (al (Ik) +|-k)) ~ cos (=) 
w_(x) = (x (Ik) - |-k)) SRD = 
dee 


The density of electrons is proportional to |W~|°. Plotting these densities on top of the 


potential, we see that w+ describes electrons that are gathered around the peaks of the 
potential, while ~_ describes electrons gathered around the minima. It is no surprise 
that the energy of w is higher than that of qw_. 
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Close to the Edge of the Brillouin Zone: Now consider an electron with 


pa ae 
a 


for some small ô. As we’ve seen, the potential causes plane wave states to mix only if 
their wavenumbers differ by some multiple of 27/a. This means that |k} = |na/a + ô) 
will mix with |k} = |-nr/a+ ô}. These states don’t quite have the same kinetic 
energy, but they have very nearly the same kinetic energy. And, as we will see, the 
perturbation due to the potential V will mean that these states still mix strongly. 


To see this mixing, we need once again to solve the eigenvalue equation (2.13) or, 
equivalently, (2.14). The eigenvalues are given by solutions to the quadratic equation 


(Ho(k) + Vo - E) (Eok) + Vo - E) —|V,)2 =0 (2.16) 


The only difference from our previous discussion is that E(k) and E(k’) are now given 
by 


ee (= +6) end. HOS i — -6) 


Om 


and the quadratic equation (2.16) becomes 
E {wr 2 (R mr? 
uan LETE ey E a a T, 
2m | a 2m a 
This equation has two solutions, EF = E4, given by 
Efun h2 2nrô\ 
ma | yr a (—— 
2m \ a? 2m a 


We’re ultimately interested in this expression when ô is small, where we anticipate that 
the effect of mixing will be important. But, as a sanity check, let’s first expand it in 


the opposite regime, when we’re far from the edge of the Brillouin zone and ô is large 
compared to the gap Vp. In this case, a little bit of algebra shows that the eigenvalues 
can be written as 

Vial? 


Ex = Eolnr/a + ô) + Vo Donato- aaas) 


But this coincides with the the expression that we got from second-order, non-degenerate 
perturbation theory (2.12). (Or, more precisely, because we have kept just a single mix- 
ing term in our discussion above we get just a single term in the sum in (2.12); for some 
choice of potentials, keeping further terms may be important.) 
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Figure 18: Energy dispersion for the free electron model. 


Our real interest is what happens close to the edge of the Brillouin zone when ô is 
small compared to the gap Vp. In this case we can expand the square-root to give 
_ Pnr? he ( 1 a) j 


E & Vo |Van + — | 1 — 
T m a Me rong |V,,| ma? 


The first collection of terms coincide with the energy at the edge of the Brillouin zone 
(2.15), as indeed it must. For us, the important new point is in the second term which 
tells us that as we approach the gaps, the energy is quadratic in the momentum 0. 


Band Structure 
We now have all we need to sketch the rough form of the energy spectrum E(k). The 
original quadratic spectrum is deformed with a number of striking features: 


e For small momenta, k < T/a, the spectrum remains roughly unchanged. 


e The energy spectrum splits into distinct bands, with gaps arising at k = n/a 
with n € Z. The size of these gaps is given by 2|V,,|, where V,, is the appropriate 
Fourier mode of the potential. 


The region of momentum space corresponding to the nt? energy band is called 
the nt? Brillouin zone. However, we usually call the 1* Brillouin zone simply the 
Brillouin zone. 
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e As we approach the edge of a band, the spectrum is quadratic. In particular, 
dE/dk — 0 at the end of a band. 


The relationship E(k) between energy and momentum is usually called the dispersion 
relation. In the present case, it is best summarised in a figure. 


Note that the spectrum within the first Brillouin zone |k| < a/a, looks very similar to 
what we saw in the tight-binding model . The qualitative differences in the two models 
arise because the tight-binding model has a finite number of states, all contained in 
the first Brillouin zone, while the nearly-free electron model has an infinite number of 
states which continue for |k| > w/a. 


2.1.3 The Floquet Matrix 


One of the main lessons that we learned above is that there are gaps in the energy 
spectrum. It’s hard to overstate the importance of these gaps. Indeed, as we saw 
briefly above, and will describe in more detail in 3.1.1, the gaps are responsible for 
some of the most prominent properties of materials, such as the distinction between 
conductors and insulators. 


Because of the important role they play, we will here describe another way to see the 
emergence of gaps in the spectrum that does not rely on perturbation theory. Consider 
a general, periodic potential V(x) = V(x +a). We are interested in solutions to the 
Schrodinger equation 


5 SE + V(n)0(x) = Eyle) (2.17) 


Since this is a second order differential equation, we know that there must be two 
solutions w(x) and w(x). However, because the potential is periodic, it must be the 
case that yı(x + a) and (x + a) are also solutions. These two sets of solutions are 
therefore related by some linear transformation 


r+ a ba 

2(a + a) Yaz) 
where F(E) is a 2 x 2 matrix which, as the notation suggests, depends on the energy of 
the solution E. It is known as the Floquet matriz and has a number of nice properties. 
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Claim: det(F) = 1. 


Proof: First some gymnastics. We differentiate (2.18) to get 


(ae sH (40) 
vale + a) v(a) 


We can combine this with our previous equation by introducing the 2 x 2 matrix 


wo- (AO n) 
vale) yala) 


which obeys the matrix equation 
W(x +a) = F(E)W (x) (2.19) 


Consider det W = yy, — Yip. You might recognise this from the earlier course on 
Differential Equations as the Wronskian. It’s simple to show, using the Schrödinger 
equation (2.17), that (det WY = 0. This means that det W is independent of x so, in 
particular, det W(x + a) = det W(x). Taking the determinant of (2.19) then tells us 
that det F = 1 as claimed. 


Claim: Tr F is real. 


Proof: We always have the choice pick the original wavefunctions w(x) and w2(x) 
to be entirely real for all x. (If they’re not, simply take the real part and this is also 
a solution to the Schrodinger equation). With this choice, the Floquet matrix itself 
has real elements, and so its trace is obviously real. But the trace is independent of 
our choice of basis of wavefunctions. Any other choice is related by a transformation 
F — AFA“, for some invertible matrix A and this leaves the trace invariant. Hence, 


even if the components of F(E) are complex, its trace remains real. 


To understand the structure of solutions to (2.18), we look at the eigenvalues, à+ and 
A_ of F(E). Of course, these too depend on the energy E of the solutions. Because 
det F = 1, they obey A,A_ = 1. They obey the characteristic equation 

 — (Tr F(E))A+1=0 


The kind of solution that we get depends on whether (Tr F(E))? < 4 or (Tr F(E))? > 4. 
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(Tr F(E))? < 4: In this case, the roots are complex and of equal magnitude. We can 


write 
àp =e and Ae" 


for some k which, assuming that the roots are distinct, lies in the range |k| < a/a. 
To see what this means for solutions to (2.18), we introduce the left-eigenvector of 
(a+, 8+)F = à+ (a+, p+). Then the linear combinations Y+ = a+% + b+Y2 obey 


pale + a) = ep (2) 


These are extended states, spread (on average) equally throughout the lattice. They 
corresponds to the bands in the spectrum. 


(Tr F(E))? > 4: Now the eigenvalues take the form 


yee” and Ag=e 


for some u. The corresponding eigenstates now obey 


pale + a) = e™ yp (x) 


States of this form are not allowed: they are unbounded either as x — +00 or as 
x — —oo. These values of energy E are where the gaps occur in the spectrum. 


We have to work a little harder when F(E) = 4 and the two eigenvalues are de- 
generate, either both +1 or both —1. This situations corresponds to the edge of the 
band. Consider the case when both eigenvalues are +1. Recall from your first course 
on Vectors and Matrices that attempting to diagonalise such a 2 x 2 matrix can result 
in two different canonical forms 


1 _{1 0 “ae 2h 1 0 
PF(E)P ae r) PF(E)P ( ) 


In the former case, there are two allowed solutions. In the latter case, you can check 
that one solution is allowed, while the other grows linearly in z. 


2.1.4 Bloch’s Theorem in One Dimension 


In both models described above, we ended up labelling states by momentum Ak. It’s 
worth pausing to ask: why did we do this? And how should we think of k? 
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Before we get to this, let’s back up and ask an even more basic question: why do 
we label the states of a free particle by momentum? Here, the answer is because 
momentum is conserved. In the quantum theory, this means that the momentum 
operator commutes with the Hamiltonian: |p, H] = 0, so that we can simultaneously 
label states by both energy and momentum. Ultimately, Noether’s theorem tells us 
that this conservation law arises because of translational invariance of the system. 


Now let’s look at our system with a lattice. We no longer have translational invari- 
ance. Correspondingly, in the nearly-free electron model, [p, H] # 0. Hopefully this 
now makes our original question sharper: why do we get to label states by k?! 


While we don’t have full, continuous translational invariance, both the models that 
we discussed do have a discrete version of translational invariance 


xr xrt+a 


As we now show, this is sufficient to ensure that we can label states by something very 
similar to “momentum”. However, the values of this momentum are restricted. This 
result is known as Bloch’s Theorem. Here we prove the theorem for our one-dimensional 
system; we will revisit it in Section 2.3.1 in higher dimensions. 


The Translation Operator 


For concreteness, let’s work with continuous space where states are described by a 
wavefunction w(x). (There is a simple generalisation to discrete situations such as the 
tight-binding model that we describe below.) We introduce the translation operator T, 
as 


TY (a) = b@ +1) 
First note that T; is a unitary operator. To see this, we just need to look at the overlap 
(alu) = | de (0) Tula) = | de oHe +) 
= | ac ge- DW) = f de Paot) 
where, in the step to the second line, we’ve simply shifted the origin. This tells us that 


T = T_;. But clearly T7 = T_; as well, so i = Tr“ and the translation operator is 
unitary as claimed. 
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Next note that the set of translation operators form an Abelian group, 
Ti Ty = Th +t, (2.20) 
with [Tn , Th] = 0. 


The translation operator is a close cousin of the familiar momentum operator 


d 
= —ih— 
d da 
The relationship between the two is as follows: the unitary translation operator is the 


exponentiation of the Hermitian momentum operator 
T = ellp/h 


To see this, we expand the exponent and observe that Tiw(x) = y(x + l) is just a 
compact way of expressing the Taylor expansion of a function 


Tiw(x) = GESHE +) w(x) 


d P @ 
= (Gefi) ylz) = y(x +1) 


We say that the momentum operator is the “generator” of infinitesimal translations. 
A quantum system is said to be invariant under translations by 1 if 
lH, Ti] = 0 (2.21) 


Phrased in this way, we can describe both continuous translational symmetry and dis- 
crete translational symmetry. A system has continuous translational invariance if (2.21) 
holds for all J. In this case, we may equivalently say that [p, H] = 0. Alternatively, 
a system may have discrete translational invariance if (2.21) holds only when / is an 
integer multiple of the lattice spacing a. Now p does not commute with H. 


Let’s look at the case of discrete symmetry. Now we can’t simultaneously diagonalise 
p and H, but we can simultaneously diagonalise 7, and H. In other words, energy 
eigenstates can be labelled by the eigenvalues of Tọ. But Ta is a unitary operator and 
its eigenvalues are simply a phase, e’? for some 0. Moreover, we want the eigenvalues 
to respect the group structure (2.20). This is achieved if we write the eigenvalue of T, 
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0 


as e? = e**! for some k, so that the eigenvalue of Tha coincides with the eigenvalue of 


T”. The upshot is that eigenstates are labelled by some k, such that 
Tarla) = Wy (a + a) = eh, (x) 


Now comes the rub. Because the eigenvalue is a phase, there is an arbitrariness in this 
labelling: states labelled by k have the same eigenvalue under T, as states labelled by 
k + 2r/a. To remedy this, we will simply require that k lies in the range 


ke |-=, =) (2.22) 
We recognise this as the first Brillouin zone. 


This, then, is the essence of physics on a lattice. We can still label states by k, but 
it now lies in a finite range. Note that we can approximate a system with continuous 
translational symmetry by taking a arbitrarily small; in this limit we get the usual 
result k € R. 


This discussion leads us directly to: 


Bloch’s Theorem in One Dimension: In a periodic potential, V(x) = V(x +a), 
there exists a basis of energy eigenstates that can be written as 


prls) = eup(a) 
where u(x) = u(x + a) is a periodic function and k lies in the Brillouin zone (2.22). 


Proof: We take %ą to be an eigenstate of the translation operator Ta, so that 
ve(x + a) = eb, (x). Then uzle +a) = e~*@+9 yy, (x + a) = ey (x) = ugle). 


Bloch’s theorem is rather surprising. One might think that the presence of a periodic 
potential would dramatically alter the energy eigenstates, perhaps localising them in 
some region of space. Bloch’s theorem is telling us that this doesn’t happen: instead 


kz are altered only by a periodic function u(x), sometimes 


the plane wave states e’ 
referred to as a Bloch function, and the fact that the wavenumber is restricted to the 


first Brillouin zone. 


Finally, note that we’ve couched the above discussion in terms of wavefunctions y(x), 
but everything works equally well for the tight-binding model with the translation 
operator defined by T,|n) = |n + 1). 
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Figure 19: The extended zone scheme. Figure 20: The reduced zone scheme. 


Crystal Momentum 


The quantity p = hk is the quantity that replaces momentum in the presence of a 
lattice. It is called the crystal momentum. Note, however, that it doesn’t have the 
simple interpretation of “mass x velocity”. (We will describe how to compute the 
velocity of a particle in terms of the crystal momentum in Section 3.2.1.) 


Crystal momentum is conserved. This becomes particularly important when we 
consider multiple particles moving in a lattice and their interactions. This, of course, 
sounds the same as the usual story of momentum. Except there’s a twist: crystal 
momentum is conserved only mod 27/a. It is perfectly possible for two particles to 
collide in a lattice environment and their final crystal momentum to differ from their 
initial crystal momentum by some multiple of 27/a. Roughly speaking, the lattice 
absorbs the excess momentum. 


This motivates us to re-think how we draw the energy spectrum. Those parts of the 
spectrum that lie outside the first Brillouin zone should really be viewed as having the 
same crystal momentum. To show this, we draw the energy spectrum as a multi-valued 
function of k € [-a/a,a/a). The spectrum that we previously saw in Figure 18 then 
looks like 


The original way of drawing the spectrum is known as the extended zone scheme. 
The new way is known as the reduced zone scheme. Both have their uses. Note that 
edges of the Brillouin zone are identified: k = m /a is the same as k = —7/a. In other 
words, the Brillouin zone is topologically a circle. 
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In the reduced zone scheme, states are labelled by both k € [—a/a,a/a) and an 
integer n = 1,2,... which tells us which band we are talking about. 


2.2 Lattices 


The ideas that we described above all go over to higher dimensions. The key difference 
is that lattices in higher dimensions are somewhat more complicated than a row of 
points. In this section, we introduce the terminology needed to describe different kinds 
of lattices. In Section 2.3, we’ll return to look at what happens to electrons moving in 
these lattice environments. 


2.2.1 Bravais Lattices 


The simplest kind of lattice is called a Bravais lattice. This is a 


e e e e 
periodic array of points defined by integer sums of linearly in- 
dependent basis vectors a;. In two-dimensions, a Bravais lattice e e e e 
A is defined by 
o e e e 
A= {r = nya, + Na » WHE Z} 
Figure 21: 


An obvious example is the square lattice shown to the right. 
We will see further examples shortly. 


In three dimensions, a Bravais lattice is defined by 
A= {r = nya; + N2a2 + 713a3 » WE Z} 


These lattices have the property that any point looks just the same as any other point. 
In mathematics, such an object would simply be called a lattice. Here we add the 
word Bravais to distinguish these from more general kinds of lattices that we will meet 
shortly. 


The basis vectors a; are called primitive lattice vectors. They are not unique. As an 
example, look at the 2-dimensional square lattice below. We could choose basis vectors 
(aı,a2) or (a},a2). Both will do the job. 


e (J e e e e e e 
e (J e e e 
ay ag 
7 
ay ay 
$ e e e e 
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A primitive unit cell is a region of space which, when translated by the primitive 
lattice vectors a;, tessellates the space. This means that the cells fit together, without 
overlapping and without leaving any gaps. These primitive unit cells are not unique. 
As an example, let’s look again at the 2-dimensional square lattice. Each of the three 
possibilities shown below is a good unit cell. 


mi oe 


Each primitive unit cell contains a single lattice point. This is obvious in the second 
and third examples above. In the first example, there are four lattice points associated 
to the corners of the primitive unit cell, but each is shared by four other cells. Counting 
these as a 1/4 each, we see that there is again just a single lattice point in the primitive 
unit cell. 


Although the primitive unit cells are not unique, each has the same volume. It is 
given by 


V = |a - (ag x a3)| (2.23) 


Because each primitive unit cell is associated to a single lattice point, V = 1/n where 
n is the density of lattice points. 


Note finally that the primitive unit cell need not have the full symmetry of the 
lattice. For example, the third possible unit cell shown above for the square lattice is 
not invariant under 90° rotations. 


For any lattice, there is a canonical choice of primitive unit cell that does inherit 
the symmetry of the underlying lattice. This is called the Wigner-Seitz cell, T. (It 
sometimes goes by the name of the Voronoi cell.) Pick a lattice point which we choose 
to be at the origin. The Wigner-Seitz cell is defined is defined to be the region of space 
around such that the origin is the closest lattice point. In equations, 


T= {x: |x| <|x-r| YreAst.r#0} 


The Wigner-Seitz cells for square and triangular lattices are given by 


=A = 


There is a simple way to construct the Wigner-Seitz cell. Draw lines from the origin 
to all other lattice points. For each of these lines, construct the perpendicular bi-sectors; 
these are lines in 2d and planes in 3d. The Wigner-Seitz cell is the inner area bounded 
by these bi-sectors. Here’s another example. 


Examples of Bravais Lattices in 2d 


Let’s look at some examples. In two dimensions, a Bravais lattice is defined by two 
non-parallel vectors a; and ag, with angle 0 # 0 between them. However, some of these 
lattices are more special than others. For example, when |a;| = |a| and 0 = 7/2, the 
lattice is square and enjoys an extra rotational symmetry. 


We will consider two Bravais lattices to be equivalent if they share the same symmetry 
group. With this definition, there are five possible Bravais lattices in two dimensions. 
They are 


e Square: |aı| = |a2| and 0 = 7/2. It has four-fold rotation symmetry and 
reflection symmetry. 


e Triangular: |a| = |a2| and @ = 7/3 or 0 = 27/3. This is also sometimes called 
a hexagonal lattice. It has six-fold rotation symmetry. 


e Rectangular: |a| 4 |a2| and 0 = 7/2. This has reflection symmetry. 


e Centred Rectangular: |a;| 4 |a2| and 0 # 7/2, but the primitive basis vectors 
should obey (2a, — a;)-a; = 0. This means that the lattice looks like a rectangle 
with an extra point in the middle. 
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e Oblique: |a;| 4 |a| and nothing special. This contains all other cases. 


The square, triangular and oblique lattices were shown on the previous page where 
we also drew their Wigner-Seitz cells. 


Not all Lattices are Bravais 


Not all lattices of interest are Bravais lattices. One particularly important lattice in 
two dimensions has the shape of a honeycomb and is shown below. 


This lattice describes a material called graphene that we will describe in more detail 
in Section 3.1.3. The lattice is not Bravais because not all points are the same. To see 
this, consider a single hexagon from the lattice as drawn below. 


Each of the red points is the same: each has a neighbour 
directly to the left of them, and two neighbours diagonally to 
the right. But the white points are different. Each of them has 
a neighbour directly to the right, and two neighbours diagonally 
to the left. 


Lattices like this are best thought by decomposing them into 


groups of atoms, where some element of each group sits on the Figure 22: 


vertices of a Bravais lattice. For the honeycomb lattice, we can 
consider the group of atoms O———®. The red vertices form a triangular lattice, with 
primitive lattice vectors 


3a 3 
fet: a a 
Meanwhile, each red vertex is accompanied by a white vertex which is displaced by 
d = (—a,0) 


This way we build our honeycomb lattice. 


— 49 — 


This kind of construction generalises. We can describe any 


O 
lattice as a repeating group of atoms, where each group sits on an J 
underlying Bravais lattice A. Each atom in the group is displaced 


from the vertex of the Bravais lattice by a vector d;. Each group of Ng 
atoms, labelled by their positions d; is called the basis. For example, O 


for the honeycomb lattice we chose the basis dı = 0 for red atoms 


j i ys Fi 23: 
and də = d for white atoms, since the red atoms sat at the positions Ipure’ 23 


of the underlying triangular lattice. In general there’s no require- 
ment that any atom sits on the vertex of the underlying Bravais lattice. The whole 
lattice is then described by the union of the Bravais lattice and the basis, U;{A + d;}. 


Examples of Bravais Lattices in 3d 


It turns out that there are 14 different Bravais lattices in three dimensions. Fortunately 
we won’t need all of them. In fact, we will describe only the three that arise most 
frequently in Nature. These are: 


e Cubic: This is the simplest lattice. The primitive lattice vectors are aligned with 
the Euclidean axes 


a, = ax s a2 = ay 7 a3 = az 


And the primitive cell has volume V = a. The Wigner-Seitz cell is also a cube, 
centered around one of the lattice points. 


e Body Centered Cubic (BCC): This is a cubic lattice, with an extra point 
placed at the centre of each cube. We could take the primitive lattice vectors to 


be 
x A a. ‘ x 
aj =ax , ag=ay , as = 5 ( +y +2) 
However, a more symmetric choice is 
a A x N a,, a 7 a, A à 
aı = 5 (“x+y +2) f a = 5 (- 9+2) ; as = 5 (x+y — 4) 


The primitive unit cell has volume V = a?/2. 
The BCC lattice can also be thought of as a cubic lattice, with a basis of two 


atoms with dı = 0 and də = x +9 +2). However, this doesn’t affect the fact 
that the BCC lattice is itself Bravais. 
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Cubic 


Figure 24: Three Bravais lattices. The different coloured atoms are there in an attempt to 
make the diagrams less confusing; they do not denote different types of atoms. 


The Alkali metals (Li, Na, K, Rb, Cs) all have a BCC structure, as do the 
Vanadium group (V, Nb, Ta) and Chromium group (Cr, Mo, W) and Iron (Fe). 
In each case, the lattice constant is roughly a œ 3 to 6 x 107! m. 


e Face Centered Cubic (FCC): This is again built from the cubic lattice, now 
with an extra point added to the centre of each face. The primitive lattice vectors 
are 

a a 


ar = 5 (f +2) , a =z (R+) , ag= 


The primitive unit cell has volume V = a?/4. 


The FCC lattice can also be thought of as a cubic lattice, now with a basis of 
four atoms sitting at dı = 0, d2 = $(x + y), dg = $(k +2) and dy = $(y +2). 
Nonetheless, it is also a Bravais lattice in its own right. 


Examples of FCC structures include several of the Alkaline earth metals (Be, 
Ca, Sr), many of the transition metals (Sc, Ni, Pd, Pt, Rh, Ir, Cu, Ag, Au) 
and the Noble gases (Ne, Ar, Kr, Xe) when in solid form, again with a = 3 to 
6 x 10~!° m in each case. 


The Wigner-Seitz cells for the BCC and FCC lattices are polyhedra, sitting inside a 
cube. For example, the Wigner-Seitz cell for the BCC lattice is shown in the left-hand 
figure. 


Examples of non-Bravais Lattices in 3d 


As in the 2d examples above, we can describe non-Bravais crystals in terms of a basis 
of atoms sitting on an underlying Bravais lattice. Here are two particularly simple 
examples. 
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Figure 25: Wigner-Seitz cell for BCC Figure 26: Salt. 


Diamond is made up of two, interlaced FCC lattices, with carbon atoms sitting at 
the basis points dı = 0 and də = rae +9 +2). Silicon and germanium also adopt this 
structure. 


Another example is salt (NaCl). Here, the basic structure is a cubic lattice, but with 
Na and Cl atoms sitting at alternate sites. It’s best to think of this as two, interlaced 
FCC lattices, but shifted differently from diamond. The basis consists of a Na atom 
at d = 0 and a Cl atom at d = §(k + +2). This basis then sits on top of an FCC 
lattice. 


2.2.2 The Reciprical Lattice 


Given a Bravais lattice A, defined by primitive vectors a;, the reciprocal lattice A* is 
defined by the set of points 


A* = {k= nibi , ni € Z} 
where the new primitive vectors b; obey 
ai b; = 27 Oj (2.24) 


A* is sometimes referred to as the dual lattice. In three dimensions, we can simply 
construct the lattice vectors b; by 


2m 1 
b; = y ack aj X ak 
where V is the volume of unit cell of A (2.23). We can also invert this relation to get 
2r 1 
ai Ve ik b; X bk 


where V* = |b, - (bz x b3)| = (27)?/V is the volume of T*, the unit cell of A*. Note 
that this shows that the reciprocal of the reciprocal lattice gives you back the original. 
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The condition (2.24) can also be stated as the requirement that 
e*¥=1 VreA, ked (2.25) 
which provides an alternative definition of the reciprocal lattice. 


Here are some examples: 


e The cubic lattice has ay = aX, a2 = ay and a3 = az. The reciprocal lattice is 
also cubic, with primitive vectors bı = (27/a)x, by = (2a/a)y and b = (27/a)z 


e The BCC lattice has a; = $(—kK+ y+), ag = $(K-y+z) and az = $(k+y—z2Z). 
The reciprocal lattice vectors are by = (27/a)(y + Z), by = (2a/a)(K +2) and 
bs = (27/a)(x+y). But we’ve seen these before: they are the lattice vectors for 
a FCC lattice with the sides of the cubic cell of length 47/a. 


We see that the reciprocal of a BCC lattice is an FCC lattice and vice versa. 


The Reciprocal Lattice and Fourier Transforms 


The reciprocal lattice should not be thought of as sitting in the same space as the 
original. This follows on dimensional grounds. The original lattice vectors a; have 
the dimension of length, [a;] = L. The definition (2.24) then requires the dual lattice 
vectors b; to have dimension [b;] = 1/L. The reciprocal lattice should be thought of 
as living in Fourier space which, in physics language, is the same thing as momentum 
space. As we’ll now see, the reciprocal lattice plays an important role in the Fourier 
transform. 


Consider a function f(x) where, for definiteness, we'll take x € R®. Suppose that 
this function has the periodicity of the lattice A, which means that f(x) = f(x+r) for 
all r € A. The Fourier transform is 


f(k) = pe ee f(x) = D e OF") f(x + r) 


reA 
= aad f dx e ™* f(x) (2.26) 
reA p 


In the second equality, we have replaced the integral over R? with a sum over lattice 
points, together with an integral over the Wigner-Seitz cell I. In going to the second 
line, we have used the periodicity of f(x). We see that the Fourier transform comes 
with the overall factor 
Alkj= $ ea (2.27) 
red 


This is an interesting quantity. It has the following property: 
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Claim: A(k) = 0 unless k € A*. 


Proof: Since we’re summing over all lattice sites, we could equally well write A(k) = 
Xea @ UX -T9) for any ro € A. This tells us that A(k) = e° A(k) for any ro € A. 
This means that A(k) = 0 unless e*?° = 1 for all rọ € A. But this is equivalent to 
saying that A(k) = 0 unless k € A*. 


In fact, we can get a better handle on the function (strictly, a distribution) A(k). 
We have 
Claim: A(k) = V* Jaca O(k — q). 


Proof: We can expand k = 5° ,k,b;, with k; € R, and r = $0, n;a; with n; € Z. 
Then, using (2.24), we have 


A(k) = o(ky)o(k2)o(k3) where oa(k) = > = 
The range of the sum in o(k) is appropriate for an infinite lattice. If, instead, we had 
a finite lattice with, say, N + 1 points in each direction, (assume, for convenience, that 
N is even), we would replace o(k) with 


N/2 —2ri mi A 
on(k) = >D en 2mikn £ - = a = ee ae 
e727tk — ] sin tk 
n=—N/2 


This function is plotted on the right for —1/2 < 
k < 1/2. We have chosen a measly N = 10 in this 
plot, but already we see that the function is heay- 
ily peaked near the origin: when k ~ O(1/N), then 
on(k) ~ O(N). As N > œ, this peak becomes 
narrower and taller and the area under it tends to- 


10- 


On (k) 


1 
N o n a o œ 
T 1 T T T T 


wards 1. To see this last point, replace sin(rk) ~ mk e 02 04 
and use the fact that a sin(x)/x = m. This shows 
that the peak near the origin tends towards a delta Figure 27: 


function. 


The function oy(k) is periodic. We learn that, for large N, oy(k) just becomes a 
series of delta functions, restricting k to be integer valued 


lim øx(k) = 2; b(k — n) 
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Looking back at (2.27), we see that these delta functions mean that the Fourier trans- 
form is only non-vanishing when k = )°,k;b; with k; € Z. But this is precisely the 
condition that k lies in the reciprocal lattice. We have 
A(k) = ġe =V* X` 5(k-q) (2.28) 
reA qe A* 
We can understand this formula as follows: if k € A*, then e~**? = 1 for all r € A and 
summing over all lattice points gives us infinity. In contrast, if k ¢ A*, then the phases 


ek oscillate wildly for different r and cancel each other out. 


The upshot is that if we start with a continuous function f(x) with periodicity A, 
then the Fourier transform (2.26) has support only at discrete points A*, 
f(k) = A(k)S(k) with S(k) = f dx e~*** f(x) 

r 

Here S(k) is known as the structure factor. Alternatively, inverting the Fourier trans- 
form, we have 

1 
(27)? 


This tells us that any periodic function is a sum of plane waves whose wavevectors lie 


ie) = 


f dk e** f(k) = aE XO e4* S(q) (2.29) 


qeA* 


on the reciprocal lattice, We’ll revisit these ideas in Section 2.4 when we discuss x-ray 
scattering from a lattice. 


2.2.3 The Brillouin Zone 


The Wigner-Seitz cell of the reciprocal lattice is called the Brillouin zone. 


We already saw the concept of the Brillouin zone in our one-dimensional lattice. 
Let’s check that this coincides with the definition given above. The one-dimensional 
lattice is defined by a single number, a, which determines the lattice spacing. The 
Wigner-Seitz cell is defined as those points which lie closer to the origin than any 
other lattice point, namely r € [—a/2,a/2). The reciprocal lattice is defined by (2.24) 
which, in this context, gives the lattice spacing b = 27/a. The Wigner-Seitz cell of this 
reciprocal lattice consists of those points which lie between [—b/2,b/2) = |—a/a, T/a). 
This coincides with what we called the Brillouin zone in Section 2.1. 


The Brillouin zone is also called the first Brillouin zone. As it is the Wigner-Seitz 
cell, it is defined as all points in reciprocal space that are closest to a given lattice point, 
say the origin. The n'è Brillouin zone is defined as all points in reciprocal space that 
are n‘* closest to the origin. All these higher Brillouin zones have the same volume as 
the first. 
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Figure 28: The Brillouin zones for a 2d square lattice. The first is shown in yellow, the 
second in pink, the third in blue. 


We can construct the Brillouin zone boundaries by drawing the perpendicular bisec- 
tors between the origin and each other point in A*. The region enclosing the origin is 
the first Brillouin zone. The region you can reach by crossing just a single bisector is 
the second Brillouin zone, and so on. In fact, this definition generalises the Brillouin 
zone beyond the simple Bravais lattices. 


As an example, consider the square lattice in 2d. The reciprocal lattice is also square. 
The first few Brillouin zones on this square lattice are shown in Figure 28. 


For the one-dimensional lattice that we looked at in Section 2.1, we saw that the 
conserved momentum lies within the first Brillouin zone. This will also be true in 
higher dimensions. This motivates us to work in the reduced zone scheme, in which these 
higher Brillouin zones are mapped back into the first. This is achieved by translating 
them by some lattice vector. The higher Brillouin zones of the square lattice in the 
reduced zone scheme are shown in Figure 29. 


Finally, note that the edges of the Brillouin zone should be identified; they label 
the same momentum state k. For one-dimensional lattices, this results in the Brillouin 
zone having the topology of a circle. For d-dimensional lattices, the Brillouin zone is 
topologically a torus T?. 
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Figure 29: The first three Brillouin zones for a square lattice in the reduced zone scheme. 


Crystallographic Notation 


The Brillouin zone of real materials is a three-dimensional space. We often want to 
describe how certain quantities — such as the energy of the electrons — vary as we 
move around the Brillouin zone. To display this information graphically, we need to 
find a way to depict the underlying Brillouin zone as a two-dimensional, or even one- 
dimensional space. Crystallographers have developed a notation for this. Certain, 
highly symmetric points in the Brillouin zone are labelled by letters. From the letter, 
you’re also supposed to remember what underlying lattice we’re talking about. 


For example, all Brillouin zones have an origin. The concept of an “origin” occurs in 
many different parts of maths and physics and almost everyone has agreed to label it 
as “0”. Almost everyone. But not our crystallographer friends. Instead, they call the 
origin I. 


From hereon, it gets more bewildering although if you stare at enough of these you 
get used to it. For example, for a cubic lattice, the centre of each face is called X, 
the centre of each edge is M while each corner is R. Various labels for BCC and FCC 
lattices are shown in Figure 30 


2.3 Band Structure 


“When I started to think about it, I felt that the main problem was to 
explain how the electrons could sneak by all the ions in a metal.... I found 
to my delight that the wave differed from a plane wave of free electron only 
by a periodic modulation.This was so simple that I didn’t think it could be 
much of a discovery, but when I showed it to Heisenberg he said right away, 
‘That’s it.”.” Felix Bloch 


Now that we’ve developed the language to describe lattices in higher dimensions, it’s 
time to understand how electrons behave when they move in the background of a fixed 
lattice. We already saw many of the main ideas in the context of a one-dimensional 
lattice in Section 2.1. Here we will describe the generalisation to higher dimensions. 


TEE 


= L xX 
N u 
r x», yE x 
bi” M ” KW 
Cubic BCC FCC 


Figure 30: The labels for various special points on the Brillouin zone. 


2.3.1 Bloch’s Theorem 


Consider an electron moving in a potential V(x) which has the periodicity of a Bravais 
lattice A, 


V(x+r)=V(x) foralreA 
Bloch’s theorem states that the energy eigenstates take the form 
Wy (x) = e™** u(x) 
where ux(x) has the same periodicity as the lattice, ux(x + r) = ux(x) for all r € A. 


There are different ways to prove Bloch’s theorem. Here we will give a simple proof 
using the ideas of translation operators, analogous to the one-dimensional proof that 
we saw in Section 2.1.4. Later, in Section 2.3.2, we will provide a more direct proof by 
decomposing the Schrodinger equation into Fourier modes. 


Our starting point is that the Hamiltonian is invariant under discrete translations 
by the lattice vectors r € A. As we explained in Section 2.1.4, these translations are 
implemented by unitary operators T}. These operators form an Abelian group, 


TeTe = Tre! (2.30) 


and commute with the Hamiltonian: [H,T,] = 0. This means that we can simultane- 
ously diagonalise H and T}, so that energy eigenstates are labelled also labelled by the 
eigenvalue of each T}. Because T, is unitary, this is simply a phase. But we also have 
the group structure (2.30) that must be respected. Suppose that translation of a given 
eigenstate by a basis element a; gives eigenvalue 


Ta, (x) = (x + a;) = eb (x) 
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Then translation by a general lattice vector r = n;a; must give 
Teh(x) = W(x + r) = ei ™%ah(x) = eG (x) 


where the vector k is defined by k- a; = 6;. In other words, we can label eigenstates of 
T, by a vector k. They obey 


TW(X) = p(x + r) = e y(x) 


Now we simply need to look at the function uk(x) = e~**y,(x). The statement 
of Bloch’s theorem is that ug(x) has the periodicity of A which is indeed true, since 
uy, (x + r) = eže Ehx + r) = e™** yy, (x) = u_(x). 


Crystal Momentum 


The energy eigenstates are labelled by the wavevector k, called the crystal momentum. 
There is an ambiguity in the definition of this crystal momentum. This is not the same 
as the true momentum. The energy eigenstates do not have a well defined momentum 
because they are not eigenstates of the momentum operator p = —ihV unless ug(x) is 
constant. Nonetheless, we will see as we go along that the crystal momentum plays a 
role similar to the true momentum. For this reason, we will often refer to k simply as 
“momentum”. 


There is an ambiguity in the definition of the crystal momentum. Consider a state 
with a crystal momentum k’ = k + q, with q € A* a reciprocal lattice vector. Then 


Wei (x) = elk Xe'4X1y) (x) = et, (x) 


where ti,(x) = e’4*u,,(x) also has the periodicity of A by virtue of the definition of the 
reciprocal lattice (2.25). 


As in the one-dimensional example, we have different options. We could choose to 
label states by k which lie in the first Brillouin zone. In this case, there will typically be 
many states with the same k and different energies. This is the reduced zone scheme. In 
this case, the energy eigenstates are labelled by two indices, Yk, n, where k is the crystal 
momentum and n is referred to as the band index. (We will see examples shortly.) 


Alternatively, we can label states by taking any k € R? where d is the dimension of 
the problem. This is the extended zone scheme. In this case that states labelled by k 
which differ by A* have the same crystal momenta. 
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2.3.2 Nearly Free Electrons in Three Dimensions 


Consider an electron moving in R in the presence of a weak potential V(x). We’ll 
assume that this potential has the periodicity of a Bravais lattice A, so 


V(x) =V(x+r) forallreA 


We treat this potential as a perturbation on the free electron. This means that we start 
with plane wave states |k) with wavefunctions 


(x|k) ay ekx 


with energy E(k) = hk?/2m. We want to see how these states and their energy levels 
are affected by the presence of the potential. The discussion will follow closely the one- 
dimensional case that we saw in Section 2.1.2 and we only highlight the differences. 


When performing perturbation theory, we’re going to have to consider the potential 
V(x) sandwiched between plane-wave states, 


1 3 iK 
k|V(x)|k') = dag Se 
KVK) = fda e (x) 
However, we’ve already seen in (2.29) that the Fourier transform of a periodic function 
can be written as a sum over wavevectors that lie in the reciprocal lattice A*, 


yV(x)= 5 eax Vj 
qeA* 
(Note: here Vq is the Fourier component of the potential and should not be confused 
with the volumes of unit cells which were denoted as V and V* in Section 2.2.) This 
means that (k|V (x)|k’) is non-vanishing only when the two momenta differ by 


k-k =q qeA* 


This has a simple physical interpretation: a plane wave state |k) can scatter into 
another plane wave state |k’) only if they differ by a reciprocal lattice vector. In other 
words, only momenta q, with q € A*, can be absorbed by the lattice. 


Another Perspective on Bloch’s Theorem 


The fact that that a plane wave state |k) can only scatter into states |k — q), with q € 
A*, provides a simple viewpoint on Bloch’s theorem, one that reconciles the quantum 
state with the naive picture of the particle bouncing off lattice sites like a ball in a 
pinball machine. Suppose that the particle starts in some state |k). After scattering, 
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we might expect it to be some superposition of all the possible scattering states |k — q). 
In other words, 


i(k—q)-x 
U(X) = ` g =a) Ck-q 
qEA* 
for some coefficients Ck-q. We can write this as 
ik-x —iq: x ik-x 
(x) =e J e™TX Gg =e US) 
qeA* 


where, by construction, u,(x + r) = ux(x) for all r € A. But this is precisely the form 
guaranteed by Bloch’s theorem. 


Although the discussion here holds at first order in perturbation theory, it is not 
hard to extend this argument to give an alternative proof of Bloch’s theorem, which 
essentially comes down to analysing the different Fourier modes of the Schrodinger 
equation. 


Band Structure 


Let’s now look at what becomes of the energy levels after we include the perturbation. 
We will see that, as in the 1d example, they form bands. The resulting eigenstates 
Wk n(X) and their associated energy levels E,,(k) are referred to as the band structure 
of the system. 


Low Momentum: Far from the edge of the Brillouin zone, the states |k) can only 


scatter into states |k + q) with greatly different energy. In this case, we can work with 
non-degenerate perturbation theory to compute the corrections to the energy levels. 


On the Boundary of the Brillouin zone: Things get more interesting when we have 


to use degenerate perturbation theory. This occurs whenever the state |k) has the same 
energy as another state |k + q) with q € A’, 


E(k) = Ex(k+q) => k? = (k+q)’ => 2k-q+¢@ =0 
This condition is satisfied whenever we can write 
1 
k = —- k 
54 +k] 


where q -kı = 0. This is the condition that we sit on the perpendicular bisector of 
the origin and the lattice point —q € A*. But, as we explained in Section 2.2.3, these 
bisectors form the boundaries of the Brillouin zones. We learn something important: 
momentum states are degenerate only when they lie on the boundary of a Brillouin 
zone. This agrees with what we found in our one-dimensional example in Section 2.1.2. 
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Figure 31: Energy contours for nearly-free electrons in the first Brillouin zone. 


We know from experience what the effect of the perturbation V(x) will be: it will lift 
the degeneracy. This means that a gap opens at the boundary of the Brillouin zone. For 
example, the energy of states just inside the first Brillouin zone will be pushed down, 
while the energy of those states just outside the first Brillouin zone will be pushed up. 
Note that the size of this gap will vary as we move around the boundary.. 


There is one further subtlety that we should mention. At a generic point on the 
boundary of the Brillouin zone, the degeneracy will usually be two-fold. However, at 
special points — such as edges, or corners — it is often higher. In this case, we must 
work with all degenerate states when computing the gap. 


All of this is well illustrated with an example. However, it’s illustrated even better if 
you do the example yourself! The problem of nearly free electrons in a two-dimensional 
square lattice is on the problem sheet. The resulting energy contours are shown in 
Figure 31. 


Plotting Band Structures in Three Dimensions 


For three-dimensional lattice, we run into the problem of depicting the bands. For this, 
we need the crystallographer’s notation we described previously. The spectrum of free 
particles (i.e. with no lattice) is plotted in the Brillouin zone of BCC and FCC lattices 
in Figure 321. 


We can then compare this to the band structure of real materials. The dispersion 
relation for silicon is also shown in Figure 32. This has a diamond lattice structure, 
which is plotted as FCC. Note that you can clearly see the energy gap of around 1.1 eV 
between the bands. 


‘Images plotted by Jan-Rens Reitsma, from Wikimedia commons. 
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Figure 32: Free band structure (in red) for BCC and FCC, together with the band structure 
for silicon, exhibiting a gap. 


How Many States in the Brillouin Zone? 


The Brillouin zone consists of all wavevectors k that lie within the Wigner-Seitz cell of 
the reciprocal lattice A*. How many quantum states does it hold? Well, if the spatial 
lattice A is infinite in extent then k can take any continuous value and there are an 
infinite number of states in the Brillouin zone. But what if the spatial lattice is finite 
in size? 


In this section we will count the number of quantum states in the Brillouin zone of 
a finite spatial lattice A. We will find a lovely answer: the number of states is equal to 
N, the number of lattice sites. 


Recall that the lattice A consists of all vectors r = }>,n;a; where a; are the primitive 
lattice vectors and n; € Z. For a finite lattice, we simply restrict the value of these 
integers to be 


0<n; < N; 


for some N;. The total number of lattice sites is then N = N,N2N3 (assuming a three- 
dimensional lattice). The total volume of the lattice is VN where V = |a; - (ag x as)| 
is the volume of the unit cell. 


The basic physics is something that we’ve met before: if we put a particle in a 
box, then the momentum hk becomes quantised. This arises because of the boundary 
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conditions that we place on the wavefunction. It’s simplest to think about a finite, 
periodic lattice where we require that the wavefunction inherits this periodicity, so 
that 


w(x + Nia;) = y(x) for each i = 1,2,3 (2.31) 


But we know from Bloch’s theorem that energy eigenstates take the form (x) = 
e*kXu. (x) where ux(x + a;) = ux (x). This means that the periodicity condition (2.31) 


becomes 


iNjk-a; Mi 
RE a > k b; 


where m; € Z and b; are the primitive vectors of the reciprocal lattice defined in (2.24). 
This is sometimes called the Born-von Karmen boundary condition. 


This is the quantisation of momentum that we would expect in a finite system. The 
states are now labelled by integers m; € Z. Each state can be thought of as occupying 
a volume in k-space, given by 

|b; : (bə x bs)| E V* 
N,N2N3 N 


where V* is the volume of the Brillouin zone. We see that the number of states that 


live inside the Brillouin zone is precisely N, the number of sites in the spatial lattice. 


2.3.3 Wannier Functions 


Bloch’s theorem tells that the energy eigenstates can be written in the form 
P(x) = e'*uy (x) 


with k lying in the first Brillouin zone and u,(x) a periodic function. Clearly these are 
delocalised throughout the crystal. For some purposes, it’s useful to think about these 
Bloch waves as arising from the sum of states, each of which is localised at a given 
lattice site. These states are called Wannier functions; they are defined as 


1 —ik-r 
w(x) = Ta pee W(X) (2.32) 


where the sum is over all k in the first Brillouin zone. 


The basic idea is that the Wannier wavefunction w,(x) is localised around the lattice 
site r € A. Indeed, using the periodicity properties of the Bloch wavefunction, it’s 
simple to show that wrr (xX +r’) = w,(x), which means that we can write w(x) = 
w(x—r). 
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The Wannier functions aren’t unique. We can always do a phase rotation u(x) > 
eX). (k) in the definition (2.32). Different choices of x(k) result in differing amounts 
of localisation of the state w,(x) around the lattice site r. 


We can invert the definition of the Wannier function to write the original Bloch 
wavefunction as 


l ik-r 
U(x) = YN > eT w(x —r) (2.33) 


recA 
which follows from (2.28). 


The Wannier functions have one final, nice property: they are orthonormal in the 
sense that 


i Ba w*(x—r'jw(x — r) ~ f Ba yer abt (x) U(X) 


k,k’ 
i ik-(r/—r) 1 
=) e r" = (r —r') 
Lir 


where, in going to the second line, we have used the orthogonality of Bloch wave- 


functions for different k (which, in turn, follows because they are eigenstates of the 
Hamiltonian with different energies). 


2.3.4 Tight-Binding in Three Dimensions 


We started our discussion of band structure in Section 2.1.1 with the one-dimensional 
tight binding model. This is a toy Hamiltonian describing electrons hopping from one 
lattice site to another. Here we’ll look at this same class of models in higher dimensional 
lattices. 


We assume that the electron can only sit on a site of the lattice r € A. The Hilbert 
space is then spanned by the states |r) with r € A. We want to write down a Hamilto- 
nian which describes a particle hopping between these sites. There are many different 
ways to do this; the simplest is 


H =Y Bolt) (e| — So tee (le) + Ie’) (rl) 
reA <rr’> 


where the label < rr’ > means that we only sum over pairs of sites r and r’ which 
are nearest neighbours in the lattice. Alternatively, if these nearest neighbours are 
connected by a set of lattice vectors a, then we can write this as 


1-5 


reA 


(2.34) 


Eolr) (r| — $ ta lr) {r + al 
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Note that we’ve just got one term here, since if |r + a) is a nearest neighbour, then so 
is |r — a). The Hamiltonian is Hermitian provided ta = t-a This Hamiltonian is easily 
solved. The eigenstates take the form 


I(k)) = x Se |r) (2.35) 


reA 


where N is the total number of lattice sites. It’s simple to check that these states 
satisfy H|w(k)) = E(k)|w(k)) with 


E(k) = Ey — > 2ta cos(k - a) (2.36) 


where the factor of 1/2 is there because we are still summing over all nearest neighbours, 
including +a. This exhibits all the properties of that we saw in the tight-binding 
model. The energy eigenstates (2.35) are no longer localised, but are instead spread 
throughout the lattice. The states form just a single band labelled, as usual, but by 
crystal momentum k lying in the first Brillouin zone. This is to be expected in the 
tight-binding model as we start with N states, one per lattice site, and we know that 
each Brillouin zone accommodates precisely N states. 


As a specific example, consider a cubic lattice. The nearest neighbour lattice sites 
area € { (+a, 0,0), (0, +a, 0), (0, 0, +a)} and the hopping parameters are the same in 
all directions: ta = t. The dispersion relation is then given by 


E(k) = Eo — 2t ( cos(kza) + cos(kya) + cos(kza)) (2:37) 
The width of this band is AF = Emax — Emin = 12t. 


Note that for small k, the dispersion relation takes the form of a free particle 


21,2 
TE Paas 


where the effective mass m* is determined by various parameters of the underlying 


E(k) = constant + 


lattice, m* = h?/2ta?. However, at higher k the energy is distorted away from the that 
of a free particle. For example, you can check that k, + ky = Fr/a (with k, = 0) is a 
line of constant energy. 


2.3.5 Deriving the Tight-Binding Model 


Above, we have simply written down the tight-binding model. But it’s interesting to 
ask how we can derive it from first principles. In particular, this will tell us what 
physics it captures and what physics it misses. 
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To do this, we start by considering a single atom which we place at the origin. The 


Hamiltonian for a single electron orbiting this atom takes the familiar form 
2 
p 
Ff cow = on + Vatom(X) 


The electrons will bind to the atom with eigenstates ¢,,(x) and discrete energies €n < 0, 


which obey 
V 
HatomPn(X) = EnPn(xX) 
A sketch of a typical potential Vatom(x) and the binding energies 
€n is Shown on the right. There will also be scattering states, with = 
energies € > 0, which are not bound to the atom. 
Our real interest lies in a lattice of these atoms. The resulting Figure 33: 


potential is 


Viattice(X) = yo Vatom (x = r) 


reA 


This is shown in Figure 34 for a one-dimensional lattice. What happens to the energy 
levels? Roughly speaking, we expect those electrons with large binding energies — 
those shown at the bottom of the spectrum — to remain close to their host atoms. But 
those that are bound more weakly become free to move. This happens because the tails 
of their wavefunctions have substantial overlap with electrons on neighbouring atoms, 
causing these states to mix. This is the physics captured by the tight-binding model. 


The weakly bound electrons which become dislodged from their host atoms are called 
valence electrons. (These are the same electrons which typically sit in outer shells and 
give rise to bonding in chemistry.) As we’ve seen previously, these electrons will form 
a band of extended states. 


Let’s see how to translate this intuition into equations. We want to solve the Hamil- 
tonian 
2 


H = 5 + Viatice(X) (2.38) 


Our goal is to write the energy eigenstates in terms of the localised atomic states n(x). 
Getting an exact solution is hard; instead, we’re going to guess an approximate solution. 


6 /.= 
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Figure 34: Extended and localised states in a lattice potential. 


First, let’s assume that there is just a single valence electron with localised wave- 
function $(x) with energy € . We know that the eigenstates of (2.38) must have Bloch 
form. We can build such a Bloch state from the localised state (x) by writing 


1 ikr 

He) = Fe 2 o(x —r) (2.39) 
where N is the number of lattice sites. This is a Bloch state because for any a € A, 
we have Yyk(x + a) = e™a y(x). Note that this is the same kind of state (2.35) that 
solved our original tight-binding model. Note also that this ansatz takes the same 
form as the expansion in terms of Wannier functions (2.33). However, in contrast to 
Wannier functions, the wavefunctions $(x) localised around different lattice sites are 
not orthogonal. This difference will be important below. 


The expected energy for the state (2.39) is 


(Vk| A |Yk) 
E(k) = ——— 
(o) (W/V) 
First, the denominator. 
oe 1 ik: (r/—r) 3 * 1 
(adh) = = D Om? | da rear) 
rr/EA 
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3 
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ss 


where, in going to the second line, we’ve used the translational invariance of the lattice. 
The function a(r) measures the overlap of the wavefuntions localised at lattice sites 
separated by r. 
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Next the numerator. To compute this, we write H = Hatom + AV (x) where 
AV (x) = Viattice(X) — Vtom = Vatom( (x = r) 
reA,r4~0 
We then have 
(ulin) = 5p Do OP faba g- Haon + AVAE r’) 


IN ek 


Soe f dx g(x = r) (Huom + AV) (x) 


reA 


= (ule) + Ae + > e**4(r) 
r#0 


Here Ac is the shift in the energy of the bound state ¢(x) due to the potential AV, 
Ac = fèr ¢*(x)AV (x)(x) 


Meanwhile, the last term arises from the overlap of localised atoms on different sites 


ve) = f dr 6" - 2) AV (x) 
The upshot of this is an expression for the expected energy of the Bloch wave (2.39) 


Ae + ai e—ikr a(r) 
Le eye alr) 


Under the assumption that a(r) < 1, we can expand out the denominator (1+2)~! = 


E(k) =e4 


1 — z, and write 
E(k) = e+ Ae + ve (1) — a(r) Ac) (2.40) 
r40 


This still looks rather complicated. However, the expression simplifies because the 
overlap functions a(r) and y(r) both drop off quickly with separation. Very often, it’s 
sufficient to take these to be non-zero only when r are the nearest neighbour lattice 
sites. Sometimes we need to go to next-to-nearest neighbours. 


An Example: s-Orbitals 


Let’s assume that a(r) and 7(r) are important only for r connecting nearest neighbour 
lattice sites; all others will be taken to vanish. We’ll further take the valence electron 
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to sit in the s-orbital. This has two consequences: first, the localised wavefunction is 
rotationally invariant, so that (r) = ¢(r). Second, the wavefunction can be taken to 
be real, so ¢*(x) = @(x). With these restrictions, we have 


a(r) = I Pxg(x — 1)6(x) = a(—r) 


We want a similar expression for y(r). For this, we need to make one further assump- 
tion: we want the crystal to have inversion symmetry. This means that V(x) = V(—x) 
or, more pertinently for us, AV(x) = AV(—x). We can then write 


y(r) = / dx 6(x — r)AV(x)o(x) 
Š f dbx! 6(—x! —r)AV(-x')6(—x’) 


7 | Pa! d(|x! +r) AV (x(x) 
= y(—r) 


where we have defined x’ = —x in the second line and used both the inversion symmetry 
and rotational invariance of the s-orbital in the third. Now we can write the energy 
(2.40) in a slightly nicer form. We need to remember that the vectors r span a lattice 
which ensures that if r is a nearest neighbour site then —r is too. We then have 


E(k) = e + Ae + Ñ` cos(k - a) (xa) = Aca(a)) (2.41) 


where a are the nearest neighbour lattice sites. We recognise this as the dispersion 
relation that we found in our original tight-binding model (2.36), with Ey = € + Ae and 
ta = y(a) — Aea(a). 


So far we’ve shown that the state (2.39) has the same energy as eigenstates of the 
tight-binding Hamiltonian. But we haven’t yet understood when the state (2.39) is a 
good approximation to the true eigenstate of the Hamiltonian (2.38). 


We can intuit the answer to this question by looking in more detail at (2.41). We 
see that the localised eigenstates ¢(x), each of which had energy <€, have spread into a 
band with energies E(k). For this calculation to be valid, it’s important that this band 
doesn’t mix with other states. This means that the energies E(k) shouldn’t be too 
low, so that it has overlap with the energies of more deeply bound states. Nor should 
E(k) be too high, so that it overlaps with the energies of the scattering states which 
will give rise to higher bands. If the various lattice parameters are chosen so that it 
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sits between these two values, our ansatz (2.39) will be a good approximation to the 
true wavefunction. Another way of saying this is that if we focus on states in the first 
band, we can approximate the Hamiltonian (2.38) describing a lattice of atoms by the 
tight-binding Hamiltonian (2.34). 


A Linear Combination of Atomic Orbitals 


What should we do if the band of interest does overlap with bands from more deeply 
bound states? The answer is that we should go back to our original ansatz (2.39) and 
replace it with something more general, namely 


ae ee >, CnOn(x — r) (2.42) 


Ui 


where this time we sum over all localised states of interest, @,(x) with energies €,. 
These are now weighted with coefficients c,, which we will determine shortly. This kind 
of ansatz is known as a linear combination of atomic orbitals. Among people who play 
these kind of games, it is common enough to have its own acronym (LCAO obviously). 


The wavefunction (2.42) should be viewed as a variational ansatz for the eigenstates, 
where we get to vary the parameters c,. The expected energy is again 


— bul lan) 
G Are 


where, repeating the calculations that we just saw, we have 


(uly) = DDD enter wy de g(x — 2) bn(x) 


reA n,n’ 


= => > iCne an,n (r) (2.43) 


reA n,n’ 
and 


(alba) = Sod een en | da by (oe — 2)(Haom + AV) bn (X) 


reA n,n’ 


=> rene? (enomp (r) + Mawl) (2.44) 


rEA n,n! 


Note that we’ve used slightly different notation from before. We haven’t isolated the 
piece Ann (r = 0) = dn», nor the analogous Ae piece corresponding to Yn, (£ = 0). 
Instead, we continue to sum over all lattice points r € A, including the origin. 
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The variational principle says that we should minimise the expected energy over all 
Cn. This means we should solve 
OE(k) 1 


ð 
Bet, ~ Wal) dor, PH Mae) — 


(Pil Hlp) 9 
(Pilt)? OCh, 


o o 
ga, Vxl lv) = E(k) ga dhi) =0 


(VklYk) = 0 


Using our expressions (2.43) and (2.44), we can write the resulting expression as the 
matrix equation 


X Mani (K)Cn = 0 (2.45) 
where M,, ,/(k) is the Hermitian matrix 


Mn (k) = XO e-** (F(t) — (E(k) = enan (r) 
reA 
The requirement (2.45) that Mnn/(k) has a zero eigenvalue can be equivalently written 
as 


det Man (k) = 0 


Let’s think about how to view this equation. The matrix Mn,w(k) is a function of 
the various parameters which encode the underlying lattice dynamics as well as E(k). 
But what we want to figure out is the dispersion relation E(k). We should view the 
condition det Mn /(k) = 0 as an equation for E(k). 


Suppose that we include p localised states at each site, so My w (k) is a p x p matrix. 
Then det Mn, (k) = 0 is a polynomial in E(k) of degree p. This polynomial will have 
p roots; these are the energies E,(k) of p bands. In each case, the corresponding null 
eigenvector is cn which tells us how the atomic orbitals mix in the Bloch state (2.42). 


2.4 Scattering Off a Lattice 


Finally, we come to an important question: how do we know that solids are made of 
lattices? The answer, of course, is scattering. Firing a beam of particles — whether 
neutrons, electrons or photons in the X-ray spectrum — at the solid reveals a char- 
acteristic diffraction pattern. Our goal here is to understand this within the general 
context of scattering theory. 


In what follows, we will assume a knowledge of the basics of scattering theory. Full 
details can be found in the lectures on Topics in Quantum Mechanics. 
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Our starting point is the standard asymptotic expression describing a wave scattering 
off a central potential, localised around the origin, 


ikr 


Yle) ~ e™® + f(k; k’) — 


(2.46) 
Here we’re using the notation, introduced in earlier sections, of the scattered momentum 
k = kf 


The idea here is that if you sit far away in the direction f, you will effectively see a wave 
with momentum k’. We therefore write f(k, k’) to mean the same thing as f(k; 0, 9). 


Suppose now that the wave scatters off a potential which is localised at some other 
position, r = R. Then the equation (2.46) becomes 


etklr-R| 


w(r) ~ ci (r-R) + f(k, KR 


For r — oo, we can expand 


lr -—R) =Vr?4+ R--RerVJ/l-2r-R/rsr-t-R 


We then have 


ikr 
poa e e a (2.47) 
r 


The overall factor is unimportant, since our interest lies in the phase shift between 
the incident wave and the scattered wave. We see that we get an effective scattering 
amplitude 


falk; f) = f(k, kJet" 
where we have defined the transferred momentum 
q=k-k 


Now let’s turn to a lattice of points A. Ignoring multiple scatterings, the amplitude is 
simply the sum of the amplitudes from each lattice point 


fa(k,k’) = f(k,k’) $ ean (2.48) 


REA 


S= 


However, we already discussed the sum A(q) = J rea TP in Section 2.2.2. The sum 
has the nice property that it vanishes unless q lies in the reciprocal lattice A*. This 
is simple to see: since we have an infinite lattice it must be true that, for any vector 
Ro € A, 
Alq) = > eaR _ > cia (R-Ro) _ eano A (q) 
REA REA 

This means that either e~*#®° = 1 or A(q) = 0. The former result is equivalent to the 
statement that q € A*. More generally, 


X eR = A(q) = V* X ôa- Q) (2.49) 
REA QEA* 
where V* is the volume of the unit cell of A*. We see that A(q) is very strongly 
(formally, infinitely) peaked on the reciprocal lattice. 


The upshot of this discussion is a lovely result: there is scattering from a lattice if 
and only if 


k= eA’ (2.50) 


This is known as the Laue condition. If the scattered momentum does not satisfy 
this condition, then the interference between all the different scattering sites results 
in a vanishing wave. Only when the Laue condition is obeyed is this interference 
constructive. 


Alternatively, the Laue condition can be viewed as momentum conservation, with 
the intuition — garnered from Section 2 — that the lattice can only absorb momentum 
in AŽ. 


Solutions to the Laue condition are not generic. If you take a lattice with a fixed 
orientation and fire a beam with fixed k, chances are that there are no solutions to 
(2.50). To see this, consider the reciprocal lattice as shown in the left-hand panel of 
the figure. From the tip of k draw a sphere of radius k. This is sometimes known as 
the Ewald sphere and its surface gives the possible transferred momenta q = k — K. 
There is scattering only if this surface passes through a point on the reciprocal lattice. 


To get scattering, we must therefore either find a wave to vary the incoming momen- 
tum k, or find a way to vary the orientation of the lattice. But when this is achieved, 
the outgoing photons k’ = kf sit only at very specific positions. In this way, we get to 
literally take a photograph of the reciprocal lattice! The resulting diffraction pattern 
for salt (NaCl) which has a cubic lattice structure is shown in the right-hand panel. 
The four-fold symmetry of the reciprocal lattice is clearly visible. 
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Figure 35: The Ewald sphere, drawn in Figure 36: Salt. 
the reciprocal lattice. 


2.4.1 The Bragg Condition 


There is an equivalent phrasing of the Laue condition in real space. Suppose that the 
momentum vectors obey 


k-k’=QeA* 


Since Q is a lattice vector, so too is nQ for all n € Z. Suppose that Q is minimal, so 
that nQ is not a lattice a vector for any n < 1. Defining the angle 0 by k-k’ = k? cos 0, 
we can take the square of the equation above to get 


2k?(1 — cos 0) = 4k’ sin?(0/2)=Q? => 2ksin(@/2)=Q 


We can massage this further. The vector Q € A* defines a set of parallel planes in A. 
Known as Bragg planes, these are labelled by an integer n and defined by those a € A 
which obey a- Q = 27n. The distance between successive planes is 


_ 20 


Q 


Furthermore, the wavevector k corresponds to a wavelength A = 2a/k. We learn that 


d 


the Laue condition written as the requirement that 


A = 2dsin(0/2) 
Repeating this argument for vectors nQ with n € Z, we get 


nà = 2dsin(6/2) 
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Figure 38: A quasi-crystal. Figure 39: DNA, Photograph 51. 


This is the Bragg condition. It has a simple interpreta- 
tion. For n = 1, we assume that the wave scatters off two 
consecutive planes of the lattice, as shown figure. The 


wave which hits the lower plane travels an extra distance roy 
of 2x = 2dsin(6/2). The Bragg condition requires this | 4 < a 
extra distance to coincide with the wavelength of light. In e2 aoa 


other words, it is the statement that waves reflecting off Figure 37: 
consecutive planes interfere constructively. 


The Bragg condition gives us licence to think about scattering of light off planes in 
the lattice, rather than individual lattice sites. Moreover, it tells us that the wavelength 
of light should be comparable to the atomic separation in the crystal. This means x- 
rays. The technique of x-ray crystallography was pioneered by Max von Laue, who 
won the 1914 Nobel prize. The Bragg law was developed by William Bragg, a fellow of 
Trinity and director of the Cavendish. He shared the 1915 Nobel prize in physics with 
his father, also William Bragg, for their development of crystallographic techniques. 


X-ray crystallography remains the most important technique to determine the struc- 
ture of materials. Two examples of historical interest are shown in the figures. The 
picture on the left is something of an enigma since it has five-fold symmetry. Yet 
there are no Bravais lattices with this symmetry! The diffraction pictures is revealing 
a quasi-crystal, an ordered but non-periodic crystal. The image on the right was taken 
by Rosalind Franklin and is known as “photograph 51”. It provided a major, and 
somewhat controversial, hint to Crick and Watson in their discovery of the structure 
of DNA. 


2.4.2 The Structure Factor 


Many crystals are described by a repeating ground of atoms, which each group sits on 
an underlying Bravais lattice A. The atoms in the group are displaced from the vertex 
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of the Bravais lattice by a vector d;. We saw several examples of this in Section 2. In 
such a situation, the scattering amplitude (2.48) is replaced by 


flattice(k, k’) = A(q) S(q) 


where 
S(q) =~ filk,k’)et 


We have allowed for the possibility that each atom in the basis has a different scattering 
amplitude f;(k,k’). The function S(q) is called the geometric structure factor. 


An Example: BCC Lattice 


As an example, consider the BCC lattice viewed as a simple cubic lattice of size a, 
with two basis vectors sitting at dı = 0 and dg = Sls 1,1). If we take the atoms on 
the points dı and də to be identical, then the associated scattering amplitudes are also 
equal: fi = f= f. 

We know that the scattering amplitude is non-vanishing only if the transferred mo- 


mentum q lies on the reciprocal lattice, meaning 


2m 
q= — (ny, n2, N3) mE Z 
a 


This then gives the structure factor 
S(q) = f (et + et) 
, 2 N; even 
= f(l+e™n) = > 
0 Š nodd 

We see that not all points in the reciprocal lattice A* contribute. If we draw the 
reciprocal, simple cubic lattice and delete the odd points, as shown in the right-hand 
figure, we find ourselves left with a FCC lattice. (Admittedly, the perspective in the 
figure isn’t great.) But this is exactly what we expect since it is the reciprocal of the 
BCC lattice. 
Another Example: Diamond 


A diamond lattice consists of two, interlaced FCC lattices with basis vectors dı = 0 
and də = $(1,1,1). An FCC lattice has reciprocal lattice vectors bı = 27 (1, 1,1), 


DE 


Figure 40: A BCC lattice as cubic lattice Figure 41: The reciprocal as a cubic lat- 
+ basis. tice minus a basis. 


bs = 2m (1, —1,1) and b; = 27 (1, 1,—1). For q = $; n;b;, the structure factor is 


2 Yon; = 0 mod 4 
1+% S$>n;=1 mod 4 
0 >> nj = 2 mod 4 
1-i Yon; =3 mod 4 


2.4.3 The Debye-Waller Factor 
So far, we’ve treated the lattice as a fixed, unmoving object. But we know from our 


discussion in Section 4 that this is not realistic. The underlying atoms can move. We 
would like to know what effect this has on the scattering off a lattice. 


Let’s return to our result (2.48) for the scattering amplitude off a Bravais lattice A, 
fa(k, k’) = f(k, k’) D eid Rn 


where f(k,k’) is the amplitude for scattering from each site, q = k — k’, and Rẹ € A. 
Since the atoms can move, the position R, are no longer fixed. We should replace 


Rn Ry + u(t) 


where, as in Section 4, u, describes the deviation of the lattice from equilibrium. In 
general, this deviation could arise from either thermal effects or quantum effects. In 
keeping with the theme of these lectures, we will restrict to the latter. But this is 
conceptually interesting: it means that the scattering amplitude includes the factor 


A(q) = S etRn eidun 


which is now a quantum operator. This is telling us something important. When a 
particle — whether photon or neutron — scatters off the lattice, it can now excite a 
phonon mode. The scattering amplitude is a quantum operator because it includes all 
possible end-states of the lattice. 
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This opens up a whole slew of new physics. We could, for example, now start to 
compute inelastic scattering, in which the particle deposits some energy in the lattice. 
Here, however, we will content ourselves with elastic scattering, which means that the 
the lattice sits in its ground state |0) both before and after the scattering. For this, we 
need to compute 


Aa) = JE e'® (ole! 0) 


To proceed, we need the results of Section 4.1.4 in which we treated lattice vibrations 
quantum mechanically. For simplicity, let’s consider a simple cubic lattice so that the 
the matrix element above factorises into terms in the x, y and z direction. For each of 
these, we can use the formalism that we developed for the one-dimensional lattice. 


The matrix element (O|e’""|0) is independent of time and is also translationally 
invariant. This means that we can evaluate it at t = 0 and at the lattice site n = 0. 
For a one-dimensional lattice with N sites, the expansion (4.11) gives 


w=) NI (a(k) + at (k)) =A+A' 
kA0 


Here we’ve used the rescaling (4.14) so that the creation and annihilation operators 
obey the usual commutation relations [a(k), a'(k’)] = 64,47. The operators a' (k) create 
a phonon with momentum k and energy w(k). The operators A and A‘ then obey 


h 
Aas 3 2mNw(k) 


Our goal now is to compute (O|e(4+4|0). For this we use the BCH formula, 


i igAt igA 14q2[At 
et A+A = e194 et^ e31 [At,A] 


But the ground state of the lattice is defined to obey a;|0) = 0 for all /. This means 
that e*14|0} = |0). We end up with the result 


i w hq? 
1q:uo = (a) — ——— 
(Olet) =e "SY where W(q) J mea) 


This is called the Debye-Waller factor. We see that the scattering amplitude becomes 
falk, K’) = eV f(k,k’)A(q) 


Note that, perhaps surprisingly, the atomic vibrations do not broaden the Bragg peaks 
away from q € A*. Instead, they only diminish their intensity. 
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3. Electron Dynamics in Solids 


In the previous chapter we have seen how the single-electron energy states form a band 
structure in the presence of a lattice. Our goal now is to understand the consequences 
of this, so that we can start to get a feel for some of the basic properties of materials. 


There is one feature in particular that will be important: materials don’t just have 
one electron sitting in them. They have lots. A large part of condensed matter physics 
is concerned with in understanding the collective behaviour of this swarm of electrons. 
This can often involve the interactions between electrons giving rise to subtle and 
surprising effects. However, for our initial foray into this problem, we will make a fairly 
brutal simplification: we will ignore the interactions between electrons. Ultimately, 
much of the basic physics that we describe below is unchanged if we turn on interactions, 
although the reason for this turns out to be rather deep. 


3.1 Fermi Surfaces 


Even in the absence of any interactions, electrons still are still affected by the presence 
of others. This is because electrons are fermions, and so subject to the Pauli exclusion 
principle. This is the statement that only one electron can sit in any given state. As 
we will see below, the Pauli exclusion principle, coupled with the general features of 
band structure, goes some way towards explaining the main properties of materials. 


Free Electrons 


As a simple example, suppose that we have no lattice. We k 
take a cubic box, with sides of length L, and throw in some 7 


large number of electrons. What is the lowest energy state of 


this system? Free electrons sit in eigenstates with momentum N IN 
hk and energy E = h?k?/2m. Because we have a system of j Ny 
finite size, momenta are quantised as k; = 2mn;/L. Further, = 


they also carry one of two spin states, |f} or |). 


The first electron can sit in the state k = 0 with, say, spin 
|t). The second electron can also have k = 0, but must have Figure 42: The Fermi 
spin ||), opposite to the first. Neither of these electrons costs surface 
any energy. However, the next electron is not so lucky. The 
minimum energy state it can sit in has n; = (1,0,0). Including spin and momentum 
there are a total of six electrons which can carry momentum |k| = 27/L. As we go on, 
we fill out a ball in momentum space. This ball is called the Fermi sea and the boundary 
of the ball is called the Fermi surface. The states on the Fermi surface are said to have 
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Figure 43: Fermi surfaces for valence Z = 1 with increasing lattice strength. 


Fermi momentum hkp and Fermi energy Ep = h?k?./2m. Various properties of the 
free Fermi sea are explored in the lectures on Statistical Physics. 


3.1.1 Metals vs Insulators 


Here we would like to understand what becomes of the Fermi sea and, more importantly, 
the Fermi surface in the presence of a lattice. Let’s recapitulate some important facts 
that we’ll need to proceed: 


e A lattice causes the energy spectrum to splits into bands. We saw in Section 
2.3.2 that a Bravais lattice with N sites results in each band having N momen- 
tum states. These are either labelled by momenta in the first Brillouin zone (in 
the reduced zone scheme) or by momentum in successive Brillouin zones (in the 
extended zone scheme). 


e Because each electron carries one of two spin states, each band can accommodate 
2N electrons. 


e Each atom of the lattice provides an integer number of electrons, Z, which are 
free to roam the material. These are called valence electrons and the atom is said 
to have valence Z. 


From this, we can piece the rest of the story together. We’ll discuss the situation for 
two-dimensional square lattices because it’s simple to draw the Brillouin zones. But 
everything we say carries over for more complicated lattices in three-dimensions. 


Suppose that our atoms have valence Z = 1. There are then N electrons, which 
can be comfortably housed inside the first Brillouin zone. In the left-hand of Figure 
43 we have drawn the Fermi surface for free electrons inside the first Brillouin zone. 
However, we know that the effect of the lattice is to reduce the energy at the edges 
of the Brillouin zone. We expect, therefore, that the Fermi surface — which is the 
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Figure 44: Lithium. Figure 45: Copper. 


equipotential Ey — will be distorted as shown in the middle figure, with states closer 
to the edge of the Brillouin zone filled preferentially. Note that the area inside the 
Fermi surface remains the same. 


If the effects of the lattice get very strong, it may be that the Fermi surface touches 
the edge of the Brillouin zone as shown in the right-hand drawing in Figure 43. Because 
the Brillouin zone is a torus, if the Fermi surface is to be smooth then it must hit the 
edge of the Brillouin zone at right-angles. 


This same physics can be seen in real Fermi surfaces. Lithium has valence Z = 1. 
It forms a BCC lattice, and so the Brillouin zone is FCC. Its Fermi surface is shown 
above, plotted within its Brillouin zone?. Copper also has valency Z = 1, with a FCC 
lattice and hence BCC Brillouin zone. Here the effects of the lattice are somewhat 
stronger, and the Fermi surface touches the Brillouin zone. 


In all of these cases, there are unoccupied states with arbitrarily small energy above 
Ep. (Strictly speaking, this statement holds only in the limit L — oo of an infinitely 
large lattice.) This means that if we perturb the system in any way, the electrons will 
easily be able to respond. Note, however, that only those electrons close to the Fermi 
surface can respond; those that lie deep within the Fermi sea are locked there by the 
Pauli exclusion principle and require much larger amounts of energy if they wish to 
escape. 


This is an important point, so lll say it again. In most situations, only those electrons 
which lie on the Fermi surface can actually do anything. This is why Fermi surfaces 
play such a crucial role in our understanding of materials. 


?This, and other pictures of Fermi surfaces, are taken from http://www.phys.ufl.edu/fermisurface/. 
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Figure 46: Fermi surfaces for valence Z = 2 with increasing lattice strength, moving from a 
metal to an insulator. 


Materials with a Fermi surface are called metals. Suppose, for example, that we 
apply a small electric field to the sample. The electrons that lie at the Fermi surface 
can move to different available states in order to minimize their energy in the presence 
of the electric field. This results in a current that flows, the key characteristic of a 
metal. We’ll discuss more about how electrons in lattices respond to outside influences 
in Section 3.2 


Before we move on, a couple of comments: 


e The Fermi energy of metals is huge, corresponding to a temperature of Er/kp ~ 
10* K, much higher than the melting temperature. For this reason, the zero 
temperature analysis is a good starting point for thinking about real materials. 


e Metals have a very large number of low-energy excitations, proportional to the 
area of the Fermi surface. This makes metals a particularly interesting theoretical 
challenge. 


Let’s now consider atoms with valency Z = 2. These je 
have 2N mobile electrons, exactly the right number to fill 
the first band. However, in the free electron picture, this 
is not what happens. Instead, they partially fill the first 
Brillouin zone and then spill over into the second Brillouin 
zone. The resulting Fermi surface, drawn in the extended 
zone scheme, is shown in left-hand picture of Figure 46 


If the effects of the lattice are weak, this will not be 
greatly changed. Both the first and second Brillouin zones Figure 47: Beryllium 
will have available states close to the Fermi surface as 
shown in the middle picture. These materials remain metals. We sometimes talk 
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Ist zone 3rd zone 
e d 
2nd zone 3rd zone re—drawn 


Figure 48: Fermi surfaces for valence Z = 3. 


of electrons in the second band, and holes (i.e. absence of electrons) in the first band. 
We will discuss this further in Section 3.2. Beryllium provides an example of a metal 
with Z = 2; its Fermi surface is shown in the figure, now plotted in the reduced zone 
scheme. It includes both an electron Fermi surface (the cigar-like shapes around the 
edge) and a hole Fermi surface (the crown in the middle). 


Finally, if the effects of the lattice become very strong, the gap between the two 
bands is large enough to overcome the original difference in kinetic energies. This 
occurs when the lowest lying state in the second band is higher than the highest state 
in the first. Now the electrons fill the first band. The second band is empty. The Fermi 
sea looks like the right-hand picture in Figure 46. This is qualitatively different from 
previous situations. There is no Fermi surface and, correspondingly, no low-energy 
excitations. Any electron that wishes to change its state can only do so by jumping 
to the next band. But that costs a finite amount of energy, equal to the gap between 
bands. This means that all the electrons are now locked in place and cannot respond 
to arbitrarily small outside influences. We call such materials insulators. (Sometimes 
they are referred to as band insulators to highlight the fact that it is the band structure 
which prevents the electrons from moving.) 


This basic characterisation remains for higher valency Z. Systems with partially 
filled bands are metals; systems with only fully-filled bands are insulators. Note that a 
metal may well have several fully-filled bands, before we get to a partially filled band. 
In such circumstances, we usually differentiate between the fully-filled lower bands — 
which are called valence bands — and the partially filled conduction band. 
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The Fermi surfaces may exist in several different bands. An example of a Fermi 
surface for Z = 3 is shown in Figure 48, the first three Brillouin zones are shown 
separately in the reduced zone scheme. At first glance, it appears that the Fermi 
surface in the 3° Brillouin zone is disconnected. However, we have to remember that 
the edges of the Brillouin zone are identified. Re-drawn, with the origin taken to be 
k = (m/a,7/a), we see the Fermi surface is connected, taking the rosette shape shown. 


Looking Forwards 


We have seen how band structure allows us to classify all materials as metals or in- 
sulators. This, however, is just the beginning, the first chapter in a long and detailed 
story which extends from physics into materials science. To whet the appetite, here 
are three twists that we can add to this basic classification. 


e For insulators, the energy required to reach the first excited state is set by the 
band gap A which, in turn, is determined by microscopic considerations. Materi- 
als whose band gap is smaller than A < 2 eV or so behave as insulators at small 
temperature, but starts to conduct at higher temperatures as electrons are ther- 
mally excited from the valence band to the conduction band. Such materials are 
called semiconductors. They have the property that their conductivity increases 
as the temperature increases. (This is in contrast to metals whose conductivity 
decreases as temperature increases.) John Bardeen, Walter Brattain and William 
Shockley won the 1956 Nobel prize for developing their understanding of semi- 
conductors into a working transistor. This, then, changed the world. 


e There are some materials which have Z = 1 but are, nonetheless, insulators. An 
example is nickel oxide NiO. This contradicts our predictions using elementary 
band structure. The reason is that, for these materials, we cannot ignore the 
interactions between electrons. Roughly speaking, the repulsive force dominates 
the physics and effectively prohibits two electrons from sitting on the same site, 
even if they have different spins. But with only one spin state allowed per site, 
each band houses only N electrons. Materials with this property are referred to 
as Mott insulators. Nevill Mott, Cavendish professor and master of Caius, won 
the 1977 Nobel prize, in part for this discovery. 


e For a long time band insulators were considered boring. The gap to the first 
excited state means that they can’t do anything when prodded gently. This 
attitude changed relatively recently when it was realised that you can be boring 
in different ways. There is a topological classification of how the phase of the 
quantum states winds as you move around the Brillouin zone. Materials in which 
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this winding is non-trivial are called topological insulators. They have wonderful 
and surprising properties, most notably on their edges where they come alive with 
interesting and novel physics. David Thouless and Duncan Haldane won the 2016 
Nobel prize for their early, pioneering work on this topic. 


More generally, there is a lesson above that holds in a much wider context. Our 
classification of materials into metals and insulators hinges on whether or not we can 
excite a multi-electron system with an arbitrarily small cost in energy. For insulators, 
this is not possible: we require a finite injection of energy to reach the excited states. 
Such systems are referred to as gapped, meaning that there is finite energy gap between 
the ground state and first excited state. Meanwhile, systems like metals are called 
gapless. Deciding whether any given quantum system is gapped or gapless is one of the 
most basic questions we can ask. It can also be one of the hardest. For example, the 
question of whether a quantum system known as Yang-Mills theory has a gap is one of 
the six unsolved milllenium maths problems. 


3.1.2 The Discovery of Band Structure 


Much of the basic theory of band structure was laid down by Felix Bloch in 1928 as 
part of his doctoral thesis. As we have seen, Bloch’s name is attached to large swathes 
of the subject. He had an extremely successful career, winning the Nobel prize in 
1952, working as the first director-general of CERN, and building the fledgling physics 
department at Stanford University. 


However, Bloch missed the key insight that band structure explains the difference 
between metals and insulators. This was made by Alan Wilson, a name less well known 
to physicists. Wilson was a student of Ralph Fowler in Cambridge. In 1931, he took 
up a research position with Heisenberg and it was here that he made his important 
breakthrough. He returned on a visit to Cambridge to spread the joy of his newfound 
discovery, only to find that no one very much cared. At the time, Cambridge was in 
the thrall of Rutherford and his motto: “There are two kinds of science, physics and 
stamp collecting”. And when Rutherford said “physics”, he meant “nuclear physics”. 


This, from Nevill Mott, 


“I first heard of [Wilson’s discovery] when Fowler was explaining it to 
Charles Ellis, one of Rutherford’s closest collaborators, who said ‘very in- 
teresting’ in a tone which implied that he was not interested at all. Neither 
was I.” 
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Figure 49: The honeycomb lattice. Figure 50: And its basis vectors. 


Nevill Mott went on to win the Nobel prize for generalising Wilson’s ideas. Wilson 
himself didn’t do so badly either. He left academia and moved to industry, rising to 
become chairman of Glaxo. 


3.1.3 Graphene 


Graphene is a two-dimensional lattice of carbon atoms, arranged in a honeycomb struc- 
ture as shown in the figure. Although it is straightforward to build many layers of these 
lattices — a substance known as graphite — it was long thought that a purely two- 
dimensional lattice would be unstable to thermal fluctuations and impossible to create. 
This changed in 2004 when Andre Geim and Konstantin Novoselov at the University 
of Manchester succeeded in isolating two-dimensional graphene. For this, they won 
the 2010 Nobel prize. As we now show, the band structure of graphene is particularly 
interesting. 


First, some basic lattice facts. We described the honeycomb lattice in Section 2.2.1. 
It is not Bravais. Instead, it is best thought of as two triangular sublattices. We define 
the primitive lattice vectors 

3 3a 
a, = CR 1) and a= C = 
where a the distance between neighbouring atoms, which in graphene is about a ~ 
1.4 x 107! m. These lattice vectors are shown in the figure. 


Sublattice A is defined as all the points r = nya, + nga with n; E€ Z. These are the 
red dots in the figure. Sublattice B is defined as all points r = nya, + nga + d with 
d = (—a,0). These are the white dots. 


The reciprocal lattice is generated by vectors bj satisfying a;-b; = 270;;. These are 


(1,v3) and by = "(1,-v8) 


_ 20 


b 
1 3a 
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This reciprocal lattice is also triangular, rotated 90° from the orig- 
inal. The Brillouin zone is constructed in the usual manner by 
drawing perpendicular boundaries between the origin and each 


other point in the reciprocal lattice. This is shown in the figure. 
We shortly see that the corners of the Brillouin zone carry par- 


ticular interest. It naively appears that there are 6 corners, but 
this should really be viewed as two sets of three. This follows be- 
cause any points in the Brillouin zone which are connected by a Figure 51: 
reciprocal lattice vector are identified. Representatives of the two, 

inequivalent corners of the Brillouin zone are given by 


1 QT 1 1 2m 1 
K = -(2b bə) = — | 1, — K’ = -(b 2b») = — | 1, —— Ll 
| 1 + be) 3a ( =) and | 1 + 2b2) 3a (1 ) (3.1) 


These are shown in the figure above. 


Tight Binding for Graphene 


The carbon atoms in graphene have valency Z = 1, with the p,-atomic orbital aban- 
doned by their parent ions and free to roam the lattice. In this context, it is usually 
called the z-orbital. We therefore write down a tight-binding model in which this elec- 
tron can hop from one atomic site to another. We will work only with nearest neighbour 
interactions which, for the honeycomb lattice, means that the Hamiltonian admits hop- 
ping from a site of the A-lattice to the three nearest neighbours on the B-lattice, and 
vice versa. The Hamiltonian is given by 


H = => Ie A) (r; B| + |r; A)r + ay; B| + |r; A) (r + ag; B| + hic. (3.2) 


recA 
where we’re using the notation 
|r; A) =|r) and |r;B)=|r+d) with d= (—a,0) 


Comparing to (2.34), we have set Eo = 0, on the grounds that it doesn’t change any of 
the physics. For what it’s worth, t ~ 2.8 eV in graphene, although we won’t need the 
precise value to get at the key physics. 


The energy eigenstates are again plane waves, but now with a suitable mixture of A 
and B sublattices. We make the ansatz 


1 ik-r z s 
WO) = Bae (cali; A) + enlr; B)) 
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Plugging this into the Schrodinger equation, we find that c4 and cg must satisfy the 


0 (k) CA CA 
= E(k 3.3 
Pe o (e) (8) = 


v(k) = -t(1 + e” + eee) 


eigenvalue equation 


where 


The energy eigenvalues of (3.3) are simply 
E(k) = £|7(k)| 


We can write this as 
2 


. ; 2 
E(k)’ = gi sa ek ar + eik-ar = t? 


1 + 2e8*"20/2 cog (5) 


Expanding this out, we get the energy eigenvalues 


E(k) = =t, | 1 + 4 cos (=) cos (4 “) + 4 cos? (4 r) 


Note that the energy spectrum is a double cover of the first Brillouin zone, symmetric 
about Æ = 0. This doubling can be traced to the fact the the honeycomb lattice 
consists of two intertwined Bravais lattices. Because the carbon atoms have valency 
Z = 1, only the lower band with E(k) < 0 will be filled. 


The surprise of graphene is that these two bands meet at special points. These occur 
on the corners k = K and k = K’ (3.1), where cos(3k,a/2) = —1 and cos(v’3k„a/2) = 
1/2. The resulting band structure is shown in Figure 523. Because the lower band is 
filled, the Fermi surface in graphene consists of just two points, K and K’ where the 
bands meet. It is an example of a semi-metal. 


Emergent Relativistic Physics 


The points k = K and K’ where the bands meet are known as Dirac points. To see 
why, we linearise about these points. Write 


k=K+q 


3The image is taken from the exciting-code website. 
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Figure 52: The band structure of graphene. 


A little Taylor expansion shows that in the vicinity of the Dirac points, the dispersion 
relation is linear 


E(k) x += al 


But this is the same kind of energy-momentum relation that we meet in relativistic 
physics for massless particles! In that case, we have Æ = |p|c where p is the momentum 
and c is the speed of light. For graphene, we have 


E(k) ~ hup|q| 


where fq is the momentum measured with respect to the Dirac point and vp = 3ta/2h is 
the speed at which the excitations propagate. In graphene, vp is about 300 times smaller 
than the speed of light. Nonetheless, it remains true that the low-energy excitations 
of graphene are governed by the same equations that we meet in relativistic quantum 
field theory. This was part of the reason for the excitement about graphene: we get to 
test ideas from quantum field theory in a simple desktop experiment. 


We can tease out more of the relativistic structure by returning to the Hamiltonian 
(3.2). Close to the Dirac point k = K we have 


v(k) = -t ( — 2¢3444/2 cog € + a) 


ARa] 
2 2 2 2 
(1-2145...) (5- we...) 


R 
$ 
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This means that the Hamiltonian in the vicinity of the Dirac point k = K takes the 
form 


H = vph í , A an ") = —vph(qro” + qyo") (3.4) 
Wa — dy 0 

where o” and o¥ are the Pauli matrices. But this is the Dirac equation for a massless 

particle moving in two-dimensions, sometimes referred to as the Pauli equation. (Note: 

our original choice of orientation of the honeycomb lattice has resulted in a slightly 

annoying expression for the Hamiltonian. Had we rotated by 90° to begin with, we 

would be left with the nicer H = ħvp q: ø where ø = (a”,a”").) 


There’s something of an irony here. In the original Dirac equation, the 2 x 2 matrix 
structure comes about because the electron carries spin. But that’s not the origin of the 
matrix structure in (3.4). Indeed, we’ve not mentioned spin anywhere in our discussion. 
Instead, in graphene the emergent “spin” degree of freedom arises from the existence 
of the two A and B sublattices. 


We get a very similar equation in the vicinity of the other Dirac point. Expanding 
k = K’ + q’, we get the resulting Hamiltonian 


H = —vrħ(qao” = quo”) 


The difference in minus sign is sometimes said to be a different handedness or helicity. 
You will learn more about this in the context of high energy physics in the lectures on 
Quantum Field Theory. 


As we mentioned above, we have not yet included the spin of the electron. This is 
trivial: the discussion above is simply repeated twice, once for spin |) and once for 
spin ||). The upshot is that the low-energy excitations of graphene are described by 
four massless Dirac fermions. One pair comes from the spin degeneracy of the electrons; 
the other from the existence of two Dirac points K and K’, sometimes referred to as 
the valley degeneracy. 


3.2 Dynamics of Bloch Electrons 


In this section, we look more closely at how electrons moving in a lattice environment 
react to external forces. We call these electrons Bloch electrons. We start by describing 
how some familiar quantities are redefined for Bloch electrons. 
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For simplicity, consider an insulator and throw in one further electron. This solitary 
electron sits all alone in an otherwise unoccupied band. The possible states available 
to it have energy E(k) where k lies in the first Brillouin zone. (The energy should also 
have a further discrete index which labels the particular band the electron is sitting in, 
but we’ll suppress this in what follows). Despite its environment, we can still assign 
some standard properties to this electron. 


3.2.1 Velocity 

The average velocity v of the electron is 

_ LOE 
~ ROK 


First note that this is simply the group velocity of a wavepacket (a concept that we’ve 


(3.5) 


Vv 


met previously in the lectures on Electromagnetism). However, the “average velocity” 
means something specific in quantum mechanics, and to prove (3.5) we should directly 
compute v = + (4| — iAV|y). 


Bloch’s theorem ensures that the electron eigenstates take the form 
(x) = e™* ux(x) 


with k in the Brillouin zone. As with the energy, we’ve suppressed the discrete band 
index on the wavefunction. The full wavefunction satisfies Hyy(x) = E(k)vx(x), so 
that u(x) obeys 


h2 
Ayuy(x) = E(k)u}(x) with Ay = T +k)? + V(x) (3.6) 
m 
We'll use a slick trick. Consider the Hamiltonian Hk+q which we expand as 
OH, l 1 0? Hk 
ak 4" Zokiki 4 


For small q, we view this as a perturbation of Hy. From our results of first order 


Aig = Hk } igl (3.7) 


perturbation theory, we know that the shift of the energy eigenvalues is 
OH, 
A = ——. 
(ukl * alte) 


But we also know the exact result: it is simply E(k +q). Expanding this to first order 
in q, we have the result 


OH, OE 


(ukg w = Ək 
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But this is exactly what we need. Using the expression (3.6) for Hy, the left-hand side 


1S 
2 


 (tu|( EV +k) te) = Z (ol — iAV ee) = Av 
This gives our desired result (3.5). 

It is perhaps surprising that eigenstates in a crystal have a fixed, average velocity. 
One might naively expect that the particle would collide with the crystal, bouncing 
all over the place with a corresponding vanishing average velocity. Yet the beauty of 
Bloch’s theorem is that this is not what happens. The electrons can quite happily glide 
through the crystal structure. 


A Filled Band Carries Neither Current nor Heat 


Before we go on, we can use the above result to prove a simple result: a completely 
filled band does not contribute to the current. This is true whether the filled band is 
part of an insulator, or part of a metal. (In the latter case, there will also be a partially 
filled band which will contribute to the current.) 


The current carried by each electron is j = —ev where —e is the electron charge. 
From (3.5), the total current of a filled band is then 
: 2e Ëk ðE 

ETE “ 
ħ Jgz (27)? Ok 


where the overall factor of 2 counts the spin degeneracy. This integral vanishes. This 
follows because E(k) is a periodic function over the Brillouin zone and the total deriva- 
tive of any periodic function always integrates to zero. 


Alternatively, if the crystal has an inversion symmetry then there is a more di- 
rect proof. The energy satisfies E(k) = E(—k), which means that OE(k)/0k = 
—0E(—k)/Ok and the contributions to the integral cancel between the two halves of 
the Brillouin zone. 


The same argument shows that a filled band cannot transport energy in the form of 
heat. The heat current is defined as 


Pk 1 f @k O(E?) 
Oof tk 
je af (np J1 (Qn) Ok 


which again vanishes when integrated over a filled band. This means that the electrons 


trapped in insulators can conduct neither electricity nor heat. Note, however, that 
while there is nothing else charged that can conduct electricity, there are other degrees 
of freedom — in particular, phonons — which can conduct heat. 
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3.2.2 The Effective Mass 


We define the effective mass tensor to be 


E \ 
*x _ £2 
M= (r) 


where we should view the right-hand side as the inverse of a matrix. 


For simplicity, we will mostly consider isotropic systems, for which m}; = m*d;; and 
the effective mass of the electron is given by 


. PE 
m* =R? Ga (3.9) 


where the derivative is now taken in any direction. This definition reduces to something 
very familiar when the electron sits at the bottom of the band, where we can Taylor 
expand to find 


E = Emin —|k — kmin sae 
7 ome | ig 
This is the usual dispersion relation or a non-relativistic particle. 


The effective mass m* has more unusual properties higher up the band. For a typical 
band structure, m* becomes infinite at some point in the middle, and is negative close 
to the top of the band. We’ll see how to interpret this negative effective mass in Section 
3.24. 


In most materials, the effective mass m* near the bottom of the band is somewhere 
between 0.01 and 10 times the actual mass of the electron. But there are exceptions. 
Near the Dirac point, graphene has an infinite effective mass by the definition (3.9), 
although this is more because we’ve used a non-relativistic definition of mass which is 
rather daft when applied to graphene. More pertinently, there are substances known, 
appropriately, as heavy fermion materials where the effective electron mass is around 
a 1000 times heavier than the actual mass. 


A Microscopic View on the Effective Mass 


We can get an explicit expression for the effective mass tensor m,; in terms of the 
microscopic electron states. This follows by continuing the slick trick we used above, 
now thinking about the Hamiltonian (3.7) at second order in perturbation theory. This 
time, we find the inverse mass matrix is given by 


(mz z Og | 1 y (Unk Pil Pn’) (Unk (Pj |Wn'k) — h.c. 


T m m oe En(k) — Ew (k) 
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where n labels the band of each state. Note that the second term takes the familiar 
form that arises in second order perturbation theory. We see that, microscopically, 
the additional contributions to the effective mass come from matrix elements between 
different bands. Nearby bands of a higher energy give a negative contribution to the 
effective mass; nearby bands of a lower energy give a positive contribution. 


3.2.3 Semi-Classical Equation of Motion 


Suppose now that we subject the electron to an external potential force of the form 
F = —VU (x). The correct way to proceed is to add U(x) to the Hamiltonian and 
solve again for the eigenstates. However, in many circumstances, we can work semi- 
classically. For this, we need that U(x) is small enough that it does not distort the 
band structure and, moreover, does not vary greatly over distances comparable to the 
lattice spacing. 


We continue to restrict attention to the electron lying in a single band. To proceed, 
we should think in terms of wavepackets, rather than plane waves. This means that 
the electron has some localised momentum k and some localised position x, within the 
bounds allowed by the Heisenberg uncertainty relation. We then treat this wavepacket 
as if it was a classical particle, where the position x and momentum ñk depend on 
time. This is sometimes referred to as a semi-classical approach. 


The total energy of this semi-classical particle is E(k)+U(x) where E(k) is the band 
energy. The position and momentum evolve such that the total energy is conserved. 


This gives 
d OE dk dx dk 
(Elk ENS -L =v. |h = 
=( (k(t)) + U(x(#))) mk at ma ¢ at vu) 
which is satisfied when 
nE = -VU =F (3.10) 
dt o l 


This should be viewed as a variant of Newton’s equation, now adapted to the lattice 
environment. In fact, we can make it look even more similar to Newton’s equation. For 
an isotropic system, the effective “mass times acceleration” is 

dv m*d (0E m* (dk O\ OE dk 
dt h a (aE) Oh & x) ðk dt 


where you might want to use index notation to convince yourself of the step in the mid- 


m* 


=F (3.11) 


dle where we lost the effective mass m*. It’s rather nice that, despite the complications 
of the lattice, we still get to use some old equations that we know and love. Of course, 
the key to this was really the definition (3.9) of what we mean by effective mass m*. 


— 95 — 


An Example: Bloch Oscillations 


Consider a Bloch electron, exposed to a constant electric field E£. The semi-classical 
equation of motion is 
. eE 
Ak=-eE = k(t)=k(0)— ra 

So the crystal momentum k increases linearly. At first glance, this is unsurprising. But 
it leads to a rather surprising effect. This is because k is really periodic, valued in the 
Brillouin zone. Like a character in a 1980s video game, when the electron leaves one 
edge of the Brillouin zone, it reappears on the other side. 


Fë We can see what this means in terms of velocity. fz 
For a typical one-dimensional band structure shown on t 
the right, the velocity v ~ k in the middle of the band, = 
but v ~ —k as the particle approaches the edge of the 
Brillouin zone. In other words, a constant electric field 


gives rise to an oscillating velocity, and hence an oscillat- aa a a a a 
ing current! This surprising effect is called Bloch oscilla- 


tions. Figure 53: 


As an example, consider a one-dimensional system with a tight-binding form of band 
structure 


E = —C cos(ka) 


Then the velocity in a constant electric field oscillates as 
Ca Ca . (eEa 
v(k) = T sin(ka) = —> sin (£) 


The Bloch frequency is w = e€a/h. If we construct a wavepacket from several different 
energy eigenstates, then the position of the particle will similarly oscillate back and 
forth. This effect was first predicted by Leo Esaki in 1970. 


Bloch oscillations are somewhat counterintuitive. They mean that a DC electric field 
applied to a pure crystal does not lead to a DC current! Yet we’ve all done experiments 
in school where we measure the DC current in a metal! This only arises because a 
metal is not a perfect crystal and the electrons are scattered by impurities or thermal 
lattice vibrations (phonons) which destroy the coherency of Bloch oscillations and lead 


to a current. 
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Bloch oscillations are delicate. The system must 
be extremely clean so that the particle does not 
collide with anything else over the time necessary 
to see the oscillations. This is too much to ask 
in solid state crystals. However, Bloch oscillations 


have been observed in other contexts, such as cold 
atoms in an artificial lattice. The time variation of 
the velocity of Caesium atoms in an optical lattice Figure 54: 
is shown in the figure’. 


3.2.4 Holes 


Consider a totally filled band, and remove one electron. We’re left with a vacancy in 
the otherwise filled band. In a zen-like manoeuvre, we ascribe properties to the absence 
of the particle. Indeed, as we will now see, this vacancy moves as if it were itself an 
independent particle. We call this particle a hole. 


Recall that our definition (3.9) means that the effective mass of electrons is negative 
near the top of the band. Indeed, expanding around the maximum, the dispersion 
relation for electrons reads 


h2 
E(k) = Emax + zz K = kna +... 


and the negative effective mass m* < 0 ensures that electrons have less energy as the 
move away from the maximum. 


Now consider filling all states except one. As the hole moves away from the maximum, 
it costs more energy (because we’re subtracting less energy!). This suggests that we 
should write the energy of the hole as 


h? 
Ehoie(k) = —E(k) = — Emax + ——|k — Kinax|? +... 
2Mirole 
where 
Mole = —m* 


so that the effective mass of the hole is positive near the top of the band, but becomes 
negative if the hole makes it all the way down to the bottom. 


‘This data is taken from “Bloch Oscillations of Atoms in an Optical Potential’ by Dahan et. al., 
Phys. Rev. Lett. vol 76 (1996), which reported the first observation of this effect. 
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The hole has other properties. Suppose that we take away an electron with momen- 
tum k. Then the resulting hole can be thought of as having momentum —k. This 
suggests that we define 


knoe = —k (3.12) 


However, the velocity of the hole is the same as that of the missing electron 
18Eroe 10E 
= v 


“emea o 


This too is intuitive, since the hole is moving in the same direction as the electron that 


we took away. 


The definitions above mean that the hole obeys the Newtonian force law with 


=-F = Fhole (3.13) 


At first sight, this is surprising: the hole experiences an opposite force to the electron. 
But there’s a very simple interpretation. The force that we typically wish to apply to 
our system is an electric field E€ which, for an electron, gives rise to 


F = -e€ 


The minus sign in (3.13) is simply telling us that the hole should be thought of as 
carrying charge +e, the opposite of the electron, 


Ficie = +eE 


We can also reach this same conclusion by computing the current. We saw in (3.8) 
that a fully filled band carries no current. This means that the current carried by a 
partially filled band is 


dk Ëk 
j= -2e | oa v) = 2e f —~ v(k 
filled (27)? l ) unfilled (27)? l ) 


The filled states are electrons carrying charge —e; the unfilled states are holes, carrying 
charge +e. 


Finally, it’s worth mentioning that the idea of holes in band structure provides a fairly 
decent analogy for anti-matter in high-energy physics. There too the electron has a 
positively charged cousin, now called the positron. In both cases, the two particles can 
come together and annihilate. In solids, this releases a few eV of energy, given by the 
gap between bands. In high-energy physics, this releases a million times more energy, 
given by the rest mass of the electron. 
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3.2.5 Drude Model Again 


The essence of Bloch’s theorem is that electrons can travel through perfect crystals 
unimpeded. And yet, in the real world, this does not happen. Even the best metals have 
aresistance, in which any current degrades and ultimately relaxes to zero. This happens 
because metals are not perfect crystals, and the electrons collide with impurities and 
vacancies, as well as thermally vibrations called phonons. 


We can model these effects in our semi-classical description by working with the 
electron equation of motion called the Drude model 


* 


mv = -e£ — “y (3.14) 
T 


Here € is the applied electric field and 7 is the scattering time, which should be thought 
of as the average time between collisions. 


We have already met the Drude model in the lectures on Electromagnetism when 
we tried to describe the conductivity in metals classically. We have now included the 
quantum effects of lattices and the Fermi surface yet, rather remarkably, the equation 
remains essentially unchanged. The only difference is that the effective mass m* will 
depend on k, and hence on v, if the electron is not close to the minimum of the band. 


In equilibrium, the velocity of the electron is 


The proportionality constant is called the mobility, p = |er/m*|. The total current 
density j = —env where n is the density of charge carriers. The equation (3.15) then 


becomes j = cE where ø is the conductivity, 


ern 


T = 


(3.16) 


m* 
We also define the resistivity p = 1/ø. This is the same result that we found in our 
earlier classical analysis, except the mass m is replaced by the effective mass m*. 


There is, however, one crucial difference that the existence of the Fermi surface has 
introduced. When bands are mostly unfilled, it is best to think of the charge carriers 
in terms of negatively charged electrons, with positive effective mass m*. But when 
bands are mostly filled, it is best to think of the charge carriers in terms of positively 
charged holes, also with positive mass my,),. In this case, we should replace the Drude 
model (3.14) with the equivalent version for holes, 


MholeV = teE — Mhole y (3.17) 
T 
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This means that certain materials can appear to have positive charge carriers, even 
though the only things actually moving are electrons. The different sign in the charge 
carrier doesn’t show up in the conductivity (3.16), which depends on e°. To see it, we 
need to throw in an extra ingredient. 


Hall Resistivity 


The standard technique to measure the charge of a material is to apply a magnetic 
field B. Classically, particles of opposite charges will bend in a opposite directions, 
perpendicular to B. In a material, this results in the classical Hall effect. 


We will discuss the motion of Bloch electrons in a magnetic field in much more 
detail in Section 3.3. (And we will discuss the Hall effect in much much more detail 
in other lectures.) Here, we simply want to show how this effect reveals the difference 
between electrons and holes. For electrons, we adapt the Drude model (3.14) by adding 
a Lorentz force, 

mv =—-e(E+v x B)- my 
T 
We once again look for equilibrium solutions with v = 0. Writing j = —nev, we now 
must solve the vector equation 


dts m , 
—jxB+—,j=E 
ne ne?T 
The solution to this is 
E= pj 
where the resistivity p is now a 3 x 3 matrix. If we take B = (0,0, B), then we have 
Pex Pry 0 


P= | —Pzy Pra 0 
0 0 Prs 


where the diagonal, longitudinal resistivity is prx = 1/o where ø is given in (3.16). The 
novelty is the off-diagonal, Hall resistivity 


=B 
We often define the Hall coefficient Ry as 
Pay 1 
R = —— = — 
i B ne 


This, as promised, depends on the charge e. This means that if we were to repeat the 
above analysis for holes (3.17) rather than electrons, we would find a Hall coefficient 
which differs by a minus sign. 
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There are metals — such as beryllium and magnesium — whose Hall coefficient has 
the “wrong sign”. We drew the Fermi surface for beryllium in Section 3.1.1; it contains 
both electrons and holes. In this case, we should add to two contributions with opposite 
signs. It turns out that the holes are the dominant charge carrier. 


3.3 Bloch Electrons in a Magnetic Field 


In this section, we continue our study of Bloch electrons, but now subjected to an 
external magnetic field B. (Note that what we call B should really be called H; it is 
the magnetising field, after taking into account any bound currents.) Magnetic fields 
play a particularly important role in solids because, as we shall see, they allow us to 
map out the Fermi surface. 


3.3.1 Semi-Classical Motion 


We again use our semi-classical equation of motion (3.10) for the electron, now with 
the Lorentz force law 


dk 
where the velocity and momentum are once again related by 
18E 
Seo o 3.19 
Y= iok ee) 


From these two equations, we learn two facts. First, the component of k parallel to B 
is constant: 4(k- B) = 0. Second, the electron traces out a path of constant energy in 
k-space. This is because 

dE OE øk 

fe oe a OP 
These two facts are sufficient for us to draw the orbit in k-space. 


The Fermi surface is, by definition, a surface of constant energy. 
The electrons orbit the surface, perpendicular to B. It’s pictured 
on the right for a spherical Fermi surface, corresponding to free 
electrons. 


Holes have an opposite electric charge, and so traverse the Fermi 
surface in the opposite direction. However, we have to also remem- 
ber that we call kyoic also has a relative minus sign (3.12). As an 
example, consider a metal with Z = 2, which has both electron and 


Figure 55: 


hole Fermi surfaces. In Figure 56, we have drawn the Fermi surfaces of holes (in purple) 
and electrons (in yellow) in the extended zone scheme, and shown their direction of 
propagation in a magnetic field. 
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Holes in the first band Electrons in the second band 


Figure 56: Pockets of electrons and holes for free electrons with Z = 2. 


Orbits in Real Space 


We can also look at the path r(t) that these orbits trace out in real space. Consider 


A 


Ê x hk = -eB x (t x B) = -eBr, (3.20) 


where r, is the position of the electron, projected onto a plane perpendicular to B, 


A A 


ri =r—(B-r)B 
Integrating (3.20), we find 


=o) “8 x (k(t) — k(0)) (3.21) 
In other words, the the particle follows the same shape trajectory 
as in k-space, but rotated about B and scaled by the magnetic 
length 1} = h/eB. For free electrons, with a spherical Fermi sur- 
face, this reproduces the classical result that electrons move in 
circles. However, as the Fermi surface becomes distorted by band 


effects this need no longer be the case, and the orbits in real space 
are no longer circles. For example, the electrons trace out the 
rosette-like shape in the Z = 3 Fermi surface that we saw in Fig- Figure 57: 

ure 48. In extreme cases its possible for the real space orbits to not 

be closed curves at all. This happens, for example, if the Fermi surface is distorted more 
in one direction than another, so it looks like the picture on the right, with electrons 
performing a loop in the Brillouin zone. These are called open Fermi surfaces. 
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3.3.2 Cyclotron Frequency 


Let’s compute the time taken for the electron to complete a closed orbit in k-space. 
The time taken to travel between two points on the orbit kı = k(t,) and kə = k(t.) is 


given by the line integral 
£2 dk 
t2 kg ti = f a 
ki [k| 


We can use (3.20) to relate |k| to the perpendicular velocity, 


3) 


h [Œ dk 
to —t, = 
A 5 adil 


This has a rather nice geometric interpretation. Consider two 


eB 


. eB. 
|k| = = Fe! TH 


so we have 


orbits, both lying in the same plane perpendicular to B, but 
with the second having a slightly higher Fermi energy E +AE. 
To achieve this, the orbit must sit slightly outside the first, with 
momentum 


OE Figure 58: 
=k 2) Atk g 
+ (3), 6 


where, as the notation suggests, A(k), can change as we move around the orbit. We 
require that A(k) is such that the second orbit also has constant energy, 


=- 


The time taken to traverse the orbit can then be written as 


R2 1 k2 
=j a A(k) dk 
t-t aE | hyd 


A(k) 


But this is simply the area of the strip that separates the two orbits; this area, which 
we call A12, is coloured in the figure. In the limit AE — 0, we have 


7 h? Aio 
- eB OE 


t2 — tı 
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We can now apply this formula to compute the time taken to complete a closed orbit. 
Let A(£) denote the area enclosed by the orbit. (Note that this will depend not only 
on E but also on the component of the momentum k- B parallel to the magnetic field.) 
The time taken to complete an orbit is 


h? OA(E) 
T = —~———_ 
eB OE 
The cyclotron frequency is defined as 
20 
yl a5 
we=F (3.22) 


One can check that the cyclotron frequency agrees with the usual result, wg = eB/m 
for free electrons. 


The fact that the cyclotron frequency we depends on some property of the Fermi 
surface — namely 0A/OE — is important because the cyclotron frequency is something 
that can be measured in experiments, since the electrons sit at resonance to absorb 
microwaves tuned to the same frequency. This gives us our first hint as to how we 
might measure properties of the Fermi surface. 


3.3.3 Onsager-Bohr-Sommerfeld Quantisation 


The combination of magnetic fields and Fermi surfaces gives rise to a host of further 
physics but to see this we will have to work a little harder. 


The heart of the problem is that, in classical physics, the Lorentz force does no 
work. In the Hamiltonian formalism, this translates into the statement that the energy 
does not depend on B when written in terms of the canonical momenta. Whenever 
the energetics of a system depend on the magnetic field, there must be some quantum 
mechanics going on underneath. In the present case, this means that we need to go 
slightly beyond the simple semi-classical description that we’ve met above, to find some 
of the discreteness that quantum mechanics introduces into the problem. 


(As an aside: this problem is embodied in the Bohr-van-Leeuwen theorem, which 
states that there can be no classical magnetism. We describe how quantum mechan- 
ics can circumvent this in the discussion of Landau diamagnetism in the lectures on 
Statistical Physics.) 


To proceed, we would ideally like to quantise electrons in the presence of both a lattice 
and a magnetic field. This is hard. We’ve learned how to quantise in the presence of 
a magnetic field in Section 1 and in the presence of lattice in Section 2, but including 
both turns out to be a much more difficult problem. Nonetheless, as we now show, 
there’s a way to cobble together an approximation solution. 
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This cobbled-together quantisation was first proposed by Onsager, but follows an 
earlier pre-quantum quantisation of Bohr and Sommerfield which suggests that, in any 
system, an approximation to the quantisation of energy levels can be found by setting 


= f p:dr =ħ(n+ y) (3.23) 


with n € Z and y an arbitrary constant. This Bohr-Sommerfeld quantisation does not, 
in general, agree with the exact result from solving the Schrödinger equation. However, 
it tends to capture the correct physics for large n, where the system goes over to its 
semi-classical description. 


In the present context, we apply Bohr-Sommerfeld quantisation to our semi-classical 
model (3.18) and (3.19). We have 


1 h h? à 
ee . dr = — -dr = k- (dk x B 
20 peur 27 fx a 27eB f a8) 


where, in the last equality, we have used our result (3.20). But this integral simply 
captures the cross-sectional area of the orbit in k-space. This is the area A(E) that 
we met above. We learn that the Bohr-Sommerfeld quantisation condition (3.23) leads 


to a quantisation of the cross-sectional areas of the Fermi surface in the presence of a 
magnetic field, 


B 27eB 


A 
ý h 


(wy) (3.24) 


This quantisation of area is actually a variant of the Landau level quantisation that we 
met in Section 1.2. There are different ways of seeing this. First, note that, for fixed 
k,, we can write the cyclotron frequency (3.22) as the difference between consecutive 
energy levels 


= 27eB Envi — En Epti = En 
“em TE Aac A i 


Rearranging, this gives 
En = hħwe(n + constant) 


which coincides with our Landau level spectrum (1.14), except that the old cyclotron 
frequency wg = eB/m has been replaced by we. 
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Alternatively, we could look at the quantisation of area in real space, rather than in 
k-space. We saw in (3.21), that the orbit in real space has the same shape as that in 
k-space, but is scaled by a factor of 1} = h/eB. This means that the flux through any 
such orbit is given by 


n, = (2) BA, = (n +7)®o (3.25) 


where ®) = 2rħ/e is the so-called quantum of fluz. But this ties in nicely with our 
discussion in Section 1.2 of Landau levels in the absence of a lattice, where we saw that 
the degeneracy of states in each level is (1.17) 


® 
crs 


which should clearly be an integer. 


The quantisation (3.24) due to a background magnetic field 


results in a re-arrangement of the Fermi surface, which now sit 
in Landau tubes whose areas are quantised. A typical example 
is shown on the right. Figure 59: 


3.3.4 Quantum Oscillations 


The formation of Landau tubes gives rise to a number of fairly striking experimental 
signatures. 


Consider a Fermi surface with energy Ep and a second surface slightly inside with 
energy Ep — dE. The region between these contains the accessible states if we probe 
the system with a small amount of energy dE. Now consider a Landau tube of cross- 
sectional area An, intersecting our Fermi surface. Typically, the Landau tube will 
intersect the Fermi surface only in some small region, as shown in left-hand picture 
of Figure 60. This means that the number of states that can contribute to physical 
processes will be fairly small. In the language that we introduced in the Statistical 
Physics lectures, the density of states g(Er)dE within this Landau tube will be small. 


However, something special happens if the area A, happens to coincide with an 
extremal area of the Fermi surface. Because the Fermi surface curves much more 
slowly at such points, the density of states g(E)dE is greatly enhanced at this point. 
This is shown in the right-hand picture of Figure 60. In fact, one can show that the 


density of states actually diverges at this point as g(E) ~ (E — E)". 
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Figure 60: Landau tubes intersecting the Fermi surface: when the area of the tube coincides 
with an extremal cross-section of the Fermi surface, there is a large enhancement in the 
available states. 


We learn that when the area quantisation takes special values, there are many more 
electrons that can contribute to any physical process. However, the area quantisation 
condition (3.24) changes with the magnetic field. This means that as we increase the 
magnetic field, the areas of Landau tubes will increase and will, occasionally, overlap 
with an extremal area in the Fermi surface. Indeed, if we denote the extremal cross- 
sectional area of the Fermi surface as Aext, we must get an enhancement in the density 
of available states whenever 


B 27eB 
a OR 


(n+ 7) = Aext 


for some n. We don’t know what y is, but this doesn’t matter: the density of states 
should occur over and over again, at intervals given by 


1 27e 1 
a (5) — Aa 


Such oscillations are seen in a wide variety of physical measurements and go by the 


collective name of quantum oscillations. 


The first, and most prominent example of quantum oscillation is the de Haas-van 
Alphen effect, in which the magnetisation M = —OF/OB varies with magnetic field. 
The experimental data for gold is shown in the Figure 61. Note that there are two 
oscillation frequencies visible in the data. The Fermi surface of gold is shown on the 


>The data is taken from I.M.Templeton, Proceedings of the Royal Society A, vol 292 (1965). Note 
the old school graph paper. 
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Figure 61: dHvA oscillations for gold. The horizontal axis is B, plotted in kG. 


right. For the oscillations above, the magnetic field is parallel to the neck of the Fermi 
surface, as shown in the figure. The two frequencies then arise because there are two 
extremal cross-sections — the neck and the belly. As the direction of the magnetic field 
is changed, different extremal cross-sections become relevant. In this way, we can map 
out the entire Fermi surface. 


The magnetisation is not the only quantity to exhibit os- 
cillations. In fact, the large enhancement in the density of states 
affects nearly all observables. For example, oscillations in the 
conductivity are known as the Shub:kov-de Haas effect. 


The experimental technique for measuring Fermi surfaces was 


pioneered by Brian Pippard, Cavendish professor and the first 
president of Clare Hall. Today, the techniques of quantum oscil- 
lations play an important role in attempts to better understand Figure 62: Gold 
some of the more mysterious materials, such as unconventional 

superconductors. 
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4. Phonons 


Until now, we’ve discussed lattices in which the atoms are fixed in place. This is, of 
course, somewhat unrealistic. In materials, atoms can jiggle, oscillating back and forth 
about their equilibrium position. The result of their collective effort is what we call 
sound waves or, at the quantum level, phonons. In this section we explore the physics 
of this jiggling. 


4.1 Lattices in One Dimension 
Much of the interesting physics can be illustrated by sticking to one-dimensional ex- 


amples. 


4.1.1 A Monotonic Chain 


We start with a simple one-dimensional lattice consisting of N equally spaced, identical 
atoms, each of mass m. This is shown below. 


go — lr X 


JIJIA 
We denote the position of each atom as £n, with n = 1,..., N. In equilibrium, the 
atoms sit at 
Tn = Nna 


with a the lattice spacing. 


The potential that holds the atoms in place takes the form }}„V (£n — £n-1). For 
small deviations from equilibrium, a generic potential always looks like a harmonic 
oscillator. The deviation from equilibrium for the n'è atom is given by 


Un (t) = xz,(t) — na 


The Hamiltonian governing the dynamics is then a bunch of coupled harmonic oscilla- 
tors 


H= Pa S - 2 (4.1) 
uam | 2 Lan na 
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where pa = mt, and X is the spring constant. (It is not to be confused with the 
wavelength.) The resulting equations of motion are 


Min = —A(2Un — Un—1 — Un41) (4.2) 


To solve this equation, we need to stipulate some boundary conditions. It’s simplest to 
impose periodic boundary conditions, extending n € Z and requiring unin = Un. For 
N > 1, which is our interest, other boundary conditions do not qualitatively change 
the physics. We can then write the solution to (4.2) as 


Un = A eiet ikna (4.3) 
Because the equation is linear, we can always take real and imaginary parts of this 


solution. Moreover, the linearity ensures that the overall amplitude A will remain 
arbitrary. 


The properties of the lattice put restrictions on the allowed values of k. First note 
that the solution is invariant under k + k + 2r /a. This means that we can restrict k 
to lie in the first Brillouin zone, 


re LED 
a a 
Next, the periodic boundary conditions uy+1 = u1 require that k takes values 
2T N N 
k = —l ith l= ——,...,— 
Na ~ gD 


where, to make life somewhat easier, we will assume that N is even so / is an integer. 
We see that, as in previous sections, the short distance structure of the lattice deter- 
mines the range of k. Meanwhile, the macroscopic size of the lattice determines the 
short distance structure of k. This, of course, is the essence of the Fourier transform. 
Before we proceed, it’s worth mentioning that the minimum wavenumber k = 27/Na 
was something that we required when discussing the Debye model of phonons in the 
Statistical Physics lectures. 


Our final task is to determine the frequency w in terms of k. Substituting the ansatz 
into the formula (4.2), we have 


We find the dispersion relation 


PEN = sin ($) 
m 2 


This dispersion relation is sketched Figure 63, with k ranging over the first Brillouin 


zone. 
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Figure 63: Phonon dispersion relation for a monatomic chain. 


Many aspects of the above discussion are familiar from the discussion of electrons in 
the tight-binding model. In both cases, we end up with a dispersion relation over the 
Brillouin zone. But there are some important differences. In particular, at small values 
of k, the dispersion relation for phonons is linear 


pea i 
m 


This is in contrast to the electron propagation where we get the dispersion relation for 
a non-relativistic, massive particle (2.6). Instead, the dispersion relation for phonons is 
more reminiscent of the massless, relativistic dispersion relation for light. For phonons, 


c, = [>e (4.4) 


This is the speed of sound in the material. 


the ripples travel with speed 


4.1.2 A Diatomic Chain 


Consider now a linear chain of atoms, consisting of alternating atoms of different types. 


The atoms on even sites have mass m; those on odd sites have mass M. For simplicity, 
we'll take the restoring forces between these atoms to be the same. The equations of 
motion are 


Mün = —X(2u2n — U2n—1 7 U2n+1) 


Man+1 = —A(2u2n41 — U2n — Uon+2) 


== 
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Figure 64: Phonon dispersion relation for a diatomic chain. 


We make the ansatz 


—iwt—2ikna B e wet 2ikna 


Um = Ae and Um+1 = 


Note that these solutions are now invariant under k > k +7/a. This reflects the fact 
that, if we take the identity of the atoms into account, the periodicity of the lattice is 
doubled. Correspondingly, the Brillouin zone is halved and k now lies in the range 
T T 
ke |---| (4.5) 
Plugging our ansatz into the two equations of motion, we find a relation between the 
two amplitudes A and B, 


#(* ‘) e -a 2 —_ (*) de 
0M/)\B — (1 + e2") 2 B 


This is viewed as an eigenvalue equation. The frequency w is determined in terms of 
the wavenumber k by requiring that the appropriate determinant vanishes. This time 
we find that there are two frequencies for each wavevector, given by 


2 _ A = 2 2 
wh = [m+ M+ Vm M)? + 4mM cos (ka)| 


The resulting dispersion relation is sketched in Figure 64 in the first Brillouin zone 
(4.5). Note that there is a gap in the spectrum on the boundary of the Brillouin zone, 
k = +1/2a, given by 


AE = h(wy — o) = WB | - 


For m = M, the gap closes, and we reproduce the previous dispersion relation, now 
plotted on half the original Brillouin zone. 


== 


The lower w_ part of the dispersion relation is called the acoustic branch. The upper 
w4, part is called the optical branch. To understand where these names come from, we 
need to look a little more closely at the the physical origin of these two branches. This 
comes from studying the eigenvectors of (4.6) which tells us the relative amplitudes of 
the two types of atoms. 


This is simplest to do in the limit k — 0. In this limit the acoustic branch has w_ = 0 
and is associated to the eigenvector 


The atoms move in phase in the acoustic branch. Meanwhile, in the optical branch we 
have w? = 2\(M~'+ m7") with eigenvector 


In the optical branch, the atoms move out of phase. 


Now we can explain the name. Often in a lattice, different sites contain ions of 
alternating charges: say, + on even sites and — on odd sites. But alternating charges 
oscillating out of phase create an electric dipole of frequency w,(k). This means that 
these vibrations of the lattice can emit or absorb light. This is the reason they are 
called “optical” phonons. 


Although our discussion has been restricted to 
one-dimensional lattices, the same basic characteri- 
sation of phonon branches occurs for higher dimen- 
sional lattices. Acoustic branches have linear disper- 
sion w ~ k for low momenta, while optical branches 


© NO? rad seo 


have non-vanishing frequency, typically higher than 
the acoustic branch. The data for the phonon spec- 
trum of NaCl is shown on the right® and clearly ex- 
hibits these features. 


Figure 65: 


4.1.3 Peierls Transition 


We now throw in two separate ingredients: we will consider the band structure of 
electrons, but also allow the underlying atoms to move. There is something rather 
special and surprising that happens for one-dimensional lattices. 


°This was taken from “Phonon Dispersion Relations in NaC? , by G. Raumo, L. Almqvist and R. 
Stedman, Phys Rev. 178 (1969). 
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We consider the simple situation described in Section 4.1.1 where we have a one- 
dimensional lattice with spacing a. Suppose, further, that there is a single electron per 
lattice site. Because of the spin degree of freedom, it results in a half-filled band, as 
explained in Section 2.1. In other words, we have a conductor. 


Consider a distortion of the lattice, in which successive pairs of atoms move closer 
to each other, as shown below. 


i 2a 


Clearly this costs some energy since the atoms move away from their equilibrium 
positions. If each atom moves by an amount ôx, we expect that the total energy cost 
is of order 


Ulattice aes NX(62x)? (4.7) 


What effect does this have on the electrons? The distortion has changed the lattice 
periodicity from a to 2a. This, in turn, will halve the Brillouin zone so the electron 
states are now labeled by 


T T 
ke (aaa) 
More importantly, from the analysis of Section 2.1, we expect that a gap will open up in 
the electron spectrum at the edges of the Brillouin zone, k = +7/2a. In particular, the 
energies of the filled electron states will be pushed down; those of the empty electron 
states will be pushed up, as shown in the Figure 66. The question that we want to ask 
is: what is the energy reduction due to the electrons? In particular, is this more or less 
than the energy Ulattice that it cost to make the distortion in the first place? 


Let’s denote the dispersion relation before the distortion as Eo(k), and the dispersion 
relation after the distortion as F_(k) for |k] € [0,7/2a) and E4 (k) for |k| € [a/2a, T/a). 
The energy cost of the distortion due to the electrons is 


N T/2a 
U aleċtron = —2 =j dk ( Ea(k) _ E_(k)) (4.8) 
Td enfo 


Here the overall minus sign is because the electrons lose energy, the factor of 2 is to 
account for the spin degree of freedom, while the factor of Na/2r is the density of 
states of the electrons. 
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Figure 66: The distortion of the lattice reduces the energy of the Fermi sea of electrons. 


To proceed, we need to get a better handle on Eo(k) and E_(k). Neither are particu- 
larly nice functions. However, for a small distortion, we expect that the band structure 
is changed only in the immediate vicinity of k = 7/2a. Whatever the form of E(k), 
we can always approximate it by a linear function in this region, 


Eo(k) ~ u+vq withq=k—- ~ (4.9) 
where u = Eo(7/2a) and v = OE) /Ok, again evaluated at k = 7/2a. Note that q < 0 
for the filled states, and q > 0 for the unfilled states. 


We can compute £_(k) in this region by the same kind of analysis that we did in 
Section 2.1. Suppose that the distortion opens up a gap A at k = 7/2a. Since there is 
no gap unless there is a distortion of the lattice, we expect that 


Aw ba (4.10) 


(or perhaps 62 to some power). To compute E_(k) in the vicinity of the gap, we can 
use our earlier result (2.16). Adapted to the present context, the energy E close to 
k = 1/2a is given by 


A2 
7 = 


(Eo(n/2a +@)- E) (Eo(7/2a q) E) 0 


Using our linearisation (4.9) of Eo, we can solve this quadratic to find the dispersion 


/ A2 
E (q) =u a uF 


Note that when evaluated at q = 0, we find the gap E, — E_ = A, as expected. The 
filled states sit in the lower branch E_. The energy gained by the electrons (4.8) is 


relation 
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dominated by the regions k = +7/2a. By symmetry, it is the same in both and given 


by 
Na f° | A? 
Ualectron xy -2 f dq (ea + a + =) 
T JN 4 


Here we have introduced a lower cut-off —A on the integral; it will not ultimately be 
important where we take this cut-off, although we will require vA >> A. The integral 
is straightforward to evaluate exactly. However, our interest lies in what happens when 
A is small. In this limit, we have 
eee) 
ne m |l6v 8v 4vA 


Both terms contribute to the decrease in energy of the electrons. The first term is of 


order A? and hence, through (4.10), of order 6x”. This competes with the energy cost 
from the lattice distortion (4.7), but there is no guarantee that it is either bigger or 
smaller. The second term with the log is more interesting. For small A, this always 
beats the quadratic cost of the lattice distortion (4.7). 


We reach a surprising conclusion: a half-filled 


T TTT 
TTF-TCNQ 4 


band in one-dimension is unstable. The lattice rear- #0602 - 


ranges itself to turn the metal into an insulator. This 
is known as the Peierls transition; it is an example of 
a metal-insulator transition. This striking behaviour 


Pp (xem) 


can be seen in one-dimensional polymer chains, such 


as the catchily named TTF-TCNQ shown in the fig- 


ure’. The resistivity — plotted on the vertical axis 


— rises sharply when the temperature drops to the EL ee 
scale A. (The figure also reveals another feature: T(K) 

as the pressure is increased, the resistivity no longer 

: : ; Figure 67: 

rises quite as sharply, and by the time you get to 

8 GPa there is no rise at all. This is because of the 


interactions between electrons become important.) 


4.1.4 Quantum Vibrations 


Our discussion so far has treated the phonons purely classically. Now we turn to 
their quantisation. At heart this is not difficult — after all, we just have a bunch of 
harmonic oscillators. However, they are coupled in an interesting way and the trick 
is to disentangle them. It turns out that we’ve already achieved this disentangling by 
writing down the classical solutions. 


"This data is taken from “Recent progress in high-pressure studies on organic conductors’, by S. 
Yasuzuka and K. Murata (2009) 
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We have a classical solution (4.3) for each kı = 2n1/Na with l = —N/2,...,.N/2. 
We will call the corresponding frequency w, = 2,/A/m| sin(kja/2)|. We can introduce a 
different amplitude for each l. The most general classical solution then takes the form 


Un(t) = Xo(t) + > o eine) q at etrt—kina) (4.11) 
120 
This requires some explanation. First, we sum over all modes l = —N/2,...,+N/2 


with the exception of l = 0. This has been singled out and written as Xo(t). It is 
the centre of mass, reflecting the fact that the entire lattice can move as one. The 
amplitudes for each | Æ 0 mode are denoted a;. Finally, we have taken the real part 
of the solution because, ultimately, u,,(t) should be real. Note that we’ve denoted the 
complex conjugation by al rather than aj in anticipation of the quantisation that we 
will turn to shortly. 


The momentum p,,(t) = mt, is given by 
Dn(t) = P(t) + oy [imwa get hime) L imujat eilwit—kina) 
140 


Now we turn to the quantum theory. We promote un and p, to operators acting on a 
Hilbert space. We should think of u,(t) and p,(t) as operators in the Heisenberg rep- 
resentation; we can get the corresponding operators in the Schrodinger representation 
simply by setting t = 0. 


Since u, and p, are operators, the amplitudes a; and al must also be operators if 
we want these equations to continue to make sense. We can invert the equations above 
by setting t = 0 and looking at 


N 
Y ie etkina = > lav etki — ky )na ne al a = N(qy i at) 
n=1 


n Uv 


Similarly, 


N 
X Pn etkina — X X [-imwvor et(ki—ky na ote imuyat, geia — —iNmw;(oy Sa at) 
n=1 


n Uv 


where we’ve used the fact that w, = w_;. We can invert these equations to find 


1 . 
a = zN eM (murun + ipn) 
1 tkyna . 
a} = ImaN 2 eh (mwun — ipn) (4.12) 


=I] = 


Similarly, we can write the centre of mass coordinates — which are also now operators 
— as 


1 1 
Xo = Noa and Py = Noa (4.13) 


At this point, we’re ready to turn to the commutation relations. The position and 
momentum of each atom satisfy 


lun, Ded = iħôn n 


A short calculation using the expressions above reveals that Xo and P) obey the rela- 
tions 
ih 
[Xo, Po] = N 


Meanwhile, the amplitudes obey the commutation relations 


[an a}] = ImaN and [ay, av] = [af a})] = 0 


This is something that we’ve seen before: they are simply the creation and annihilation 
operators of a simple harmonic oscillator. We rescale 


/ h 
= 4.14 
ii 2mwı N a ( ) 


mma = dy and laa] = olal] = 0 


then our new operators a; obey 


Phonons 


We now turn to the Hamiltonian (4.1). Substituting in our expressions (4.12) and 
(4.13), and after a bit of tedious algebra, we find the Hamiltonian 


eae X (ajar + Via 
= — a — 
2M Z u 


Here M = Nm is the mass of the entire lattice. Since this is a macroscopically large 
object, we set Py = 0 and focus on the Hilbert space arising from the creation operators 
al. After our manipulations, these are simply N, decoupled harmonic oscillators. 
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The ground state of the system is a state |0) obeying 
a,|0) =0 V1 


Each harmonic oscillator gives a contribution of hw;/2 to the zero-point energy Eo of 
the ground state. However, this is of no interest. All we care about is the energy 
difference between excited states and the ground state. For this reason, it’s common 
practice to redefine the Hamiltonian to be simply 


H = ` hwala 
140 


so that H|0) = 0. 


The excited states of the lattice are identical to the excited states of the harmonic 
oscillators. For each l, the first excited state is given by a10) and has energy E = hu). 
However, although the mathematics is identical to that of the harmonic oscillator, the 
physical interpretation of this state is rather different. That’s because it has a further 
quantum number associated to it: this state carries crystal momentum Ak;. But an 
object which carries both energy and momentum is what we call a particle! In this 
case, it’s a particle which, like all momentum eigenstates, is not localised in space. This 
particle is a quantum of the lattice vibration. It is called the phonon. 


Note that the coupling between the atoms has lead to a quantitative change in the 
physics. If there was no coupling between atoms, each would oscillate with frequency 
mA and the minimum energy required to excite the system would be ~ hm. However, 
when the atoms are coupled together, the normal modes now vibrate with frequencies 
uw. For small k, these are w, ~% ant. 
of 1/N. In the limit of an infinite lattice, N — oo, there are excited states with 
infinitesimally small energies. We say that the system is gapless, meaning that there 
is no gap betwen the ground state and first excited state. In general, the question of 


The key thing to notice here is the factor 


whether a bunch interacting particles is gapped or gapless is one of the most basic (and, 
sometimes, most subtle) questions that you can ask about a system. 


Any state in the Hilbert space can be written in the form 


-m am 
lv) =] ]f Frat |) 


l 


and has energy 


H(p) = >D ħnwi 
l 
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This state should be thought of as described $`; n; phonons and decomposes into nı 
phonons with momentum Ak; for each l. The full Hilbert space constructed in this way 
contains states consisting of an arbitrary number of particles. It is referred to as a Fock 
space. 

Because the creation operators al commute with each other, there is no difference 
between the state |) ~ alal |0} and |w) ~ al,al|0). This is the statement that phonons 
are bosons. 


The idea that harmonic oscillator creation operators actually create particles some- 
times goes by the terrible name of second quantisation. It is misleading — nothing has 
been quantised twice. 


Quantisation of Acoustic and Optical Phonons 


It is not difficult to adapt the discussion above to vibrations of a diatomic lattice that 
we met in Section 4.1.2. We introduce two polarization vectors, ex(k). These are 


eigenvectors obeying the matrix equation (4.6), 


2 —(1+ e7ke) _ we m 0 o 
ed ee ae 


We then write the general solution as 


( Urn (t) = y `. ANS atesten + alaere torn 


Uan+i(t) keBZ s=+ 


where the creation operators obey 


[as(k), as (k')'] = bs.55ie and [as(k), as (k')] = [af (k), as (k')'] = 0 


f 


Now the operators a' (k) create acoustic phonons while a', (k) create optical phonons, 


each with momentum ñk. 


4.1.5 The Mossbauer Effect 


There’s a rather nice application of phonons that goes by the name of the Mossbauer 
effect. This is to do with how nuclei in solids absorb gamma rays. 
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To understand this, we first need to think about atoms absorb light, and then contrast 
this with how nuclei absorb light. To this end, consider a gas of atoms, all sitting in 
the ground state. If we shine light on the atoms at very specific frequencies, then the 
atoms will absorb the light by jumping to excited states. The frequency should be 


Ey = ħv = Eexcite 


where Eexcite is the energy difference between the excited state and the ground state. 
Once the atom absorbs a photon, it will sit in the excited state for some time and then 
decay. If it drops back down to the ground state, the emitted photon will again have 
energy E, and can be absorbed by another atom. This then repeats, a process known 
as resonant absoption. 


However, a little thought shows that the situation is slightly more complicated than 
we’ve made out. Suppose, for simplicity, that the original atom was at rest. In the col- 
lision with the atom, both energy and momentum must be conserved. The momentum 
of the incoming photon is p, = E,/c and, after the collision, this is transferred to the 
atom, SO Patom = E,/c. This means that the atom has kinetic energy from the recoil, 


(gre = b 
2M 2M c? 


Bais (4.15) 
where M is the mass of the atom. (The speed of the atom is small enough that we 
can use the non-relativistic form of kinetic energy.) So we see that it’s not quite right 
to say that the energy of the photon should be tuned to the energy difference Fexcite 
because this ignores the energy that goes into the recoil. Instead, the incoming photon 
should have slightly higher energy, Ey = Excite + Evecoil, Or 


Es (Erexcite)” 
E; = Eexcite t zy > Ey © Eexcite + aoe t... 4.16 
7 x toMa x tet 2Mc? P ( ) 
Meanwhile, when the atom now decays back to the ground state, it will emit the photon 
in a random direction. This means that the atom typically remains in motion; indeed, 
it’s quite possible that the kinetic energy of atom increases yet again if it emits the 
photon back in the direction it came. All of this means that the energy of the emitted 


photon that the atom emits is smaller than the energy of the photon that it absorbed. 


The question is: what happens next? In particular, is it possible for this emitted 
photon to be re-absorbed by a different atom so that we get resonant absorption? This 
is now a quantitative question, rather than a qualitative question. The key point is 
that you don’t need to tune the frequency of light exactly to Excite in order to excite 
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an atom. Instead, there is a range of energies — a so-called line width — that will do the 
job. This line width is related to the lifetime 7 of the excited state by AE ~ h/r. (See 
the chapter on scattering in the lectures on Topics in Quantum Mechanics for more 
details.) 


Let’s put in some numbers. The energy needed to excite an electron from one level 
to another is measured in Excite & eV. Meanwhile the mass of, say, an iron atom is 
around Mc? ~ 5 x 10* MeV. This means that the correction term (4.16) in the photon 
energy is of order AE, ~ 107" eV. This is significantly smaller than the line width 
of atomic excitations, and the discussion above has no relevance to absorption of light 
due to transitions of electrons from one energy level to another. 


However, things are very different when it comes to nuclear transitions. Now the 
relevant excitation energy is of order Excite ~ 104 eV, corresponding to soft gamma 
rays, and the correction term (4.16) in the photon energy due to recoil effects is AF = 
10-3 eV. This time the energy is significantly larger than the line width: a typical 
nuclear excitation has lifetime 7r ~ 1077 seconds and a width F ~ 1078 eV. The upshot 
of this argument is that, while X-ray absorption lines are seen corresponding to atomic 
excitations, we should not expect to see a repeat in the gamma-ray spectrum associated 
to nuclear excitations. 


And yet.... while it’s true that gamma ray resonant absorption lines are not seen in 
gasses, they are seen solids. This is the Mossbauer effect. The important point is that 
a nucleus in an atom is coupled to all the other atoms through the bonds in a solid. A 
nucleus will recoil when hit by a photon, as in the discussion above, but now the atom 
will bounce back into position and the energy Eyeco will typically be distributed into 
phonon degrees of freedom. When there are a large number of phonons excited, the 
story is not different from that told above, and the emitted photon has a sufficiently 
different frequency to kill resonant absorption. However, there is some probability that 
no phonons are created, but instead the entire solid moves absorbs the momentum of 
the photon. In this case, the recoil energy is still given by (4.15) but with M is the 
mass of the solid, rather than the mass of a single atom. This gives an extra factor 
of around 1073 in the denominator, and the recoil energy becomes negligible. For this 
to happen, the entire solid must react coherently as a single quantum object! The 
resulting gamma ray resonant absorption spectrum is indeed observed. 
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4.2 From Atoms to Fields 


If we look at a solid at suitably macroscopic distances, we don’t notice the underlying 
atomic structure. Nonetheless, it’s still straightforward to detect sound waves. This 
suggests that we should be able to formulate a continuum description of the solid that 
is ignorant of the underlying atomic make-up. 


With this in mind, we define the displacement field for a one-dimensional lattice. 
This is a function u(x,t). It is initially defined only at the lattice points 


u(x = na) = un 


However, we then extend this field to all x € R, with the proviso that our theory will 
cease to make sense if u(x) varies appreciably on scales smaller than a. 


The equation governing the atomic displacements is (4.2) 
Mün = —A(2Un — Un—1 — Un+1) 


In the continuum limit, this difference equation becomes the wave equation 


Ou Ou 
Pas = À a? (4.17) 


where p = m/a is the density of our one-dimensional solid, and \’ = Aa. These are 


the macroscopic parameters. Note, in particular, that the speed of sound (4.4) can be 
written purely in terms of these macroscopic parameters, c = \'/p. 


The equation of motion (4.17) can be derived from the action 


s= fom BGY 2) 


This is the field theory for the phonons of a one-dimensional solid. 


4.2.1 Phonons in Three Dimensions 


For three-dimensional solids, there are three displacement fields, u;(x), one for each 
direction in which the lattice can deform. In general, the resulting action can depend 
on various quantities Ou;/Ozx4. However, if the underlying lattice is such that the long- 
wavelength dynamics is rotationally invariant, then the action can only be a function 


IT" 2\ dri Axi 


of the symmetric combination 
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If we want an equation of motion linear in the displacement, then the most general 
action is a function of UijUij or ae (The term upk is a total derivative and does not 
affect the equation of motion). We have 


1 N 
S= [tate 5 = e (Z) = 2U UijUij — À UiUjj (4.18) 


The coefficients u and À are called Lamé coeffcients; they characterise the underlying 
solid. 


This action gives rise to the equations of motion 


u; i Pu; i u; 
Pop — H+ Nasa "apa7 (S1 


We can look for solutions of the form 


u(x, t) = ç; elk x+ut) 


where e; determines the polarisation of the wave. Plugging this ansatz into the equation 
of motion gives us the relation 


pwe, = uk?ei + (u + A)(e- kk; 


The frequency of the wave depends on the polarisation. There are two different options. 
Longitudinal waves have k ~ e. These have dispersion 


2 À 
pay (4.20) 
p 
Meanwhile, transverse waves have e- k = 0 and dispersion 
wt = Ék? (4.21) 
p 


Note that both of these dispersion relations are linear. The continuum approximation 
only captures the low-k limit of the full lattice system and does not see the bending 
of the dispersion relation close to the edge of the Brillouin zone. This is because it is 
valid only at long wavelengths, ka < 1. 


The general solution to (4.19) is then 


ak 1 , . 
(x,t) =a) Ga 3 Dpw,(k) e (ask) ellk-x—wst) 4 gt (Ik) D (4.22) 


where the s sum is over the three polarisation vectors, two transverse and one longi- 
tudinal. The frequencies ws(k) correspond to either (4.20) or (4.21) depending on the 
choice of s. 
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4.2.2 From Fields to Phonons 


Although we have discarded the underlying atoms, this does not mean that we have 
lost the discrete nature of phonons. To recover them, we must quantise the field theory 
defined by the action (4.18). This is the subject of Quantum Field Theory. You will 
learn much (much) more about this in next year’s lectures. What follows is merely a 
brief taster for things to come. 


To quantise the field, we need only follow the same path that we took in Section 
4.1.4. At every step, we simply replace the discrete index n with the continuous index 
x. Note, in particular, that x is not a dynamical variable in field theory; it is simply a 
label. 


First, we turn the field u(x) into an operator. This means that the amplitudes 
as(k) and al (k) in (4.22) also become operators. To proceed, we need the momentum 
conjugate to u;(x,t). This too is now a field, and is determined by the usual rules of 
classical dynamics, 


Written in terms of the solution (4.22), we have 


dk 1 
(x,t) = Do smo G (- iwsas(k) #08 +4 iw at (k) =) 


The canonical commutation relations are the field-theoretical analog of the usual position- 
momentum commutation relations, 


[ty (x), Tj (x)| =iħ bij O° (x — x’) 


At this point we have some straightforward but tedious calculations ahead of us. We 
will skip these on the grounds that you will see them in glorious detail in later courses. 
The first is an inverse Fourier transform, which expresses a,(k) and al(k) in terms of 
ui(x) and 7;(x). The result is analogous to (4.12). We then use this to determine the 
commutation relations, 


las(k), a} (k')] = dss (k-k) and [as(k), as (k')] = [al(k), a}, (k’)] = 0 


This is the statement that these are creation and annihilation operators for harmonic 
oscillators, now labelled by both a discrete polarisation index s = 1,2,3 as well as the 
continuous momentum index k. 
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The next fairly tedious calculation is the Hamiltonian. This too follows from standard 
rules of classical dynamics, together with a bunch of Fourier transforms. When the dust 
settles, we find that, up to an irrelevant overall constant, 


n= > f a huog(k)al (Ik) a (ke) 


This is simply the Hamiltonian for an infinite number of harmonic oscillators. 


The interpretation is the same as we saw in Section 4.1.4. We define the ground 
state of the field theory to obey a,(k)|0) = 0 for all s and for all k. The Fourier modes 
of the field al(k) are then to be viewed as creating and destroying phonons which 
carry momentum hk, polarisation €, and energy hw,(k). In this way, we see particles 
emerging from an underlying field. 


Lessons for the Future 


This has been a very quick pass through some basic quantum field theory, applied to 
the vibrations of the lattice. Buried within the mathematics of this section are two, 
key physical ideas. The first is that a coarse grained description of atomic vibrations 
can be described in terms of a continuous field. The second is that quantisation of the 
field results in particles that, in the present context, we call phonons. 


There is a very important lesson to take from the second of these ideas, a lesson 
which extends well beyond the study of solids. All of the fundamental particles that 
we know of in Nature — whether electrons, quarks, photons, or anything else — arise 
from the quantisation of an underlying field. This is entirely analogous to the way that 
phonons arose in the discussion above. 


Is there also a lesson to take away from the first idea above? Could it be that the 
fundamental fields of Nature themselves arise from coarse-graining something smaller? 
The honest answer is that we don’t know. However, perhaps surprisingly, all signs point 
towards this not being the case. First, and most importantly, there is no experimental 
evidence that the fundamental fields in our Universe have a discrete underpinning. But 
at the theoretical level, there are some deep mathematical reasons — to do with chiral 
fermions and topology — which suggest that it is not possible to find a discrete system 
from which the known laws of physics emerge. It would appear that our Universe 
does not have something akin to the atomic lattice which underlies the phonon field. 
Understanding these issues remains a vibrant topic of research, both in condensed 
matter physics and in high energy physics. 
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